Working with grapheme clusters
ecmascript at lindenbergsoftware.com
Fri Oct 25 23:42:20 PDT 2013
On Oct 25, 2013, at 18:35 , Jason Orendorff <jason.orendorff at gmail.com> wrote:
> UTF-16 is designed so that you can search based on code units
> alone, without computing boundaries. RegExp searches fall in this
Not if the RegExp is case insensitive, or uses a character class, or ".", or a quantifier - these all require looking at code points rather than UTF-16 code units in order to support the full Unicode character set.
More information about the es-discuss