nmh-workers
[Top] [All Lists]

Re: Bug reported regarding Unicode handling in email address

2021-06-14 11:34:10
Steffen Nurpmeso wrote in
 <20210614162626(_dot_)VfjxT%steffen(_at_)sdaoden(_dot_)eu>:
 ...
 | 
<20210614121214(_dot_)84C16210F4(_at_)orac(_dot_)inputplus(_dot_)co(_dot_)uk>:
 ...
 ||Why not iconv(3) the input from the user's locale, the MIME part's
 ||charset, etc., to UTF-8, work internally, and then iconv() again on the
 ...
 |functions do not support graphemes, and __STDC_ISO_10646__ is an
 |option, so that you cannot simply code some tables on your own to
 |fill the gaps, because looking at the wchar_t codepoints may not
 |give you a Unicode "codepoint" (though maybe all do it like that
 |so in practice you could make this a precondition).  I had to

To add that if i recall correctly citrus for example does this,
using the upper bits of wchar_t for state info, but i have
forgotten whether that was done in an UTF-8 locale, or rather in
CJK or SHIFT-JS or whatever (my gut says the latter).

--steffen
|
|Der Kragenbaer,                The moon bear,
|der holt sich munter           he cheerfully and one by one
|einen nach dem anderen runter  wa.ks himself off
|(By Robert Gernhardt)

<Prev in Thread] Current Thread [Next in Thread>