Re: RFC 2047 and gatewaying

ietf-822

Re: RFC 2047 and gatewaying

2003-01-04 12:23:34


Russ Allbery writes:

However, UTF-8 penalizes non-ASCII characters spacewise, and is
somewhat more complex to parse and reason about than a pure multibyte
character set.


Have you ever written a program to handle Unicode characters correctly?
Do you realize that UTF-16 is not a ``pure multibyte'' encoding outside
the Basic Multilingual Plane? Do you realize that Unicode has zero-width
accents, so any ``byte count equals width'' rule can't possibly work?

The notion that UTF-16 is simpler than UTF-8 seems to come from broken
programs that (1) don't handle zero-width characters, (2) don't handle
double-width characters, and (3) are limited to the BMP.

---D. J. Bernstein, Associate Professor, Department of Mathematics,
Statistics, and Computer Science, University of Illinois at Chicago

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
Unicode principles (Was Re: UTF-8 versions (was: Re: RFC 2047 and gatewaying)), (continued) Unicode principles (Was Re: UTF-8 versions (was: Re: RFC 2047 and gatewaying)), Bruce Lilly Re: Unicode principles (Was Re: UTF-8 versions (was: Re: RFC 2047 and gatewaying)), Andrew Gierth Re: Unicode principles (Was Re: UTF-8 versions (was: Re: RFC 2047 and gatewaying)), Philip Hazel Re: Unicode principles (Was Re: UTF-8 versions (was: Re: RFC 2047 and gatewaying)), Bruce Lilly Re: UTF-8 versions (was: Re: RFC 2047 and gatewaying), Charles Lindsey Re: RFC 2047 and gatewaying, Charles Lindsey Re: RFC 2047 and gatewaying, Bruce Lilly Re: RFC 2047 and gatewaying, Charles Lindsey Re: RFC 2047 and gatewaying, Bruce Lilly Re: RFC 2047 and gatewaying, Charles Lindsey Re: RFC 2047 and gatewaying, D. J. Bernstein <= Re: RFC 2047 and gatewaying, Russ Allbery Re: RFC 2047 and gatewaying, Charles Lindsey [OT] mUTF-7 in IMAP, Marc Mutz Re: RFC 2047 and gatewaying, Charles Lindsey Re: RFC 2047 and gatewaying, Lawrence Greenfield Re: RFC 2047 and gatewaying, ned+ietf-822 Re: RFC 2047 and gatewaying, Sam Roberts Re: RFC 2047 and gatewaying, ned+ietf-822 Re: RFC 2047 and gatewaying, ned+ietf-822 Re: RFC 2047 and gatewaying, Charles Lindsey

Previous by Date:	RE: RFC 2047 and gatewaying, Dan Kohn
Next by Date:	Re: RFC 2047 and gatewaying, Russ Allbery
Previous by Thread:	Re: RFC 2047 and gatewaying, Charles Lindsey
Next by Thread:	Re: RFC 2047 and gatewaying, Russ Allbery
Indexes:	[Date] [Thread] [Top] [All Lists]