Working with grapheme clusters

Norbert Lindenberg ecmascript at lindenbergsoftware.com
Fri Oct 25 23:42:20 PDT 2013


On Oct 25, 2013, at 18:35 , Jason Orendorff <jason.orendorff at gmail.com> wrote:

> UTF-16 is designed so that you can search based on code units
> alone, without computing boundaries. RegExp searches fall in this
> category.

Not if the RegExp is case insensitive, or uses a character class, or ".", or a quantifier - these all require looking at code points rather than UTF-16 code units in order to support the full Unicode character set.

Norbert



More information about the es-discuss mailing list