UTF-16 vs UTF-32

Boris Zbarsky bzbarsky at MIT.EDU
Mon May 16 19:15:16 PDT 2011


On 5/16/11 9:07 PM, John Tamplin wrote:
> Personally, I think UTF16 is more prone to error than either UTF8 or
> UTF32 -- in UTF32 there is a one-to-one correspondence

One-to-one correspondence between string code units and Unicode 
codepoints.

Unfortunately, "Unicode codepoint" is only a useful concept for some 
scripts...  So you run into the same edge-case issues as UTF-16 does, 
but in somewhat fewer cases.

-Boris


More information about the es-discuss mailing list