New full Unicode for ES6 idea

Tab Atkins Jr. jackalmage at
Tue Feb 21 15:15:50 PST 2012

On Tue, Feb 21, 2012 at 3:11 PM, Brendan Eich <brendan at> wrote:
> Hi Mark, thanks for this post.
> Mark Davis ☕ wrote:
>> UTF-8 represents a code point as 1-4 8-bit code units
> "1-6".
> Lock up your encoders, I am so not a Unicode guru but this is what my
> reptile coder brain remembers.

Only theoretically.  UTF-8 has been locked down to the same range that
UTF-16 has (RFC 3629), so the largest real character you'll see is 4
bytes, as that gives you exactly 21 bits of data.


More information about the es-discuss mailing list