ietf-822
[Top] [All Lists]

Re: encoded-words, parameter continuation and visually ordered charsets

2002-04-30 09:12:16

In <200204291229(_dot_)g3TCT0e14486(_at_)astro(_dot_)cs(_dot_)utk(_dot_)edu> 
Keith Moore <moore(_at_)cs(_dot_)utk(_dot_)edu> writes:

It should in any case be possible to decode all encoded-words, in
the order in they appear, into a single string of Unicode symbols and 
then present that string.  If the charset in any of those encoded
words is implicitly visually ordered then it will be necessary
for the corresponding sequence of Unicode symbols to be marked as 
visually ordered also.

Yes, I think that is a desirable property.

Consider the case where the material in question was originally a string
of Unicode symbols, including the proper codings to change the direction
from/to RTL at various points.

It is now passed to some RFC 2047 encoding mechanism (possibly in a
different application from the one that generated the string) in order
that it may become a valid RFC 2822 message. This encoding mechanism
chooses to split the string into encoded words at arbitrary points not
related to the direction changes (because it is a totally naive encoder
knowing nothing of the subtleties of Unicode - just trying to keep the
encoded words within that length limit).

At the far end, it has to be turned back into Unicode, and clearly we want
the overall encoding/decoding to be a nullop.

-- 
Charles H. Lindsey ---------At Home, doing my own thing------------------------
Tel: +44 161 436 6131 Fax: +44 161 436 6133   Web: http://www.cs.man.ac.uk/~chl
Email: chl(_at_)clw(_dot_)cs(_dot_)man(_dot_)ac(_dot_)uk      Snail: 5 
Clerewood Ave, CHEADLE, SK8 3JU, U.K.
PGP: 2C15F1A9      Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5

<Prev in Thread] Current Thread [Next in Thread>