ietf-822
[Top] [All Lists]

Re: UTF-8 over RFC 2047 (Re: Call for Usefor to recharter)

2003-01-14 18:10:13

On 15 Jan 2003 00:38:02 -0000
"D. J. Bernstein" <djb(_at_)cr(_dot_)yp(_dot_)to> wrote:


Keith Moore writes:
I don't share your confidence that this happens in every environment,
for every program that generates header fields.

That isn't your problem. If you want C-normalized text on the wire,
specify C-normalized text on the wire. You don't need to know whether
software accomplishes this by having text C-normalized in the keyboard
interface (as everyone does now) or in the MUA (which would work too).

okay, whatever. 

Anyway, your hypothetical Unicode normalization problems would also
arise in normalizing RFC 2047, so you can't use normalization as an
argument against moving to UTF-8.
Which is precisely why 2047 was never intended for anything that needs
to be interpreted by machine.

Even with that (shortsighted) restriction, RFC 2047 was badly designed.
For example, you should have declared encoding in a separate header
field, not by inserting bytes into existing header fields. You should
have required 8-bit-clean software in 1991; pandering to 7-bit software
was a huge mistake. You should have required UTF-8 support in 1994. You
should have required UTF-8 as a default in 1998.

your naivete is showing even worse than usual -- about what was technically
feasible at the time, what was politically feasible at the time, what would
interoperate with the installed base at the time, what powers "I" had at 
any of these times, etc.  even with ten years of hindsight I'm not sure
how I could have made 1342 much better given the constraints we had to 
work with at the time.

but no matter, 1342/1522/2047 are water under the bridge anyway. 

Keith