Hi Mark, thanks for this post. Mark Davis ☕ wrote: > UTF-8 represents a code point as 1-4 8-bit code units "1-6". > UTF-16 represents a code point as 2 or 4 16-bit code units "1 or 2". Lock up your encoders, I am so not a Unicode guru but this is what my reptile coder brain remembers. /be