Q: Lonely surrogates and unicode regexps

Marja Hölttä marja at chromium.org
Wed Jan 28 08:27:41 PST 2015


Cool, thanks for clarifications!

To make sure, as per the "intended semantics", we never allow splitting a
valid surrogate pair (= matching only one of the surrogates but not the
other), and thus we'll differ from the Java implementation here:

/foo(.+)bar\1/u.test("foo\uD834bar\uD834\uDC00"); we say "false", Java says
"true".

(In addition, /^(.+)\1$/u.test("\uDC00foobar\uD834\uDC00foobar\uD834") ==
false.)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.mozilla.org/pipermail/es-discuss/attachments/20150128/f16fa62d/attachment.html>


More information about the es-discuss mailing list