Looking at the first 8-bit character in the excerpt, E9 in iso8859-1, that should have been converted to C3A9 in UTF-8. iconv correctly does that: $ printf '\xE9' | iconv -f iso-8859-1 -t utf-8 | hexdump -C 00000000 c3 a9 |..| Instead, it got converted to C383C2A9. I'm not sure why.
Hm. C3 83 is the UTF-8 for U+00C3. C2 A9 is the UTF-8 for U+00A9. And the UTF-8 for U+00E9 is C3 A9. Is it possible it was ran through iconv twice? --Ken
<Prev in Thread] | Current Thread | [Next in Thread> |
---|---|---|
|
Previous by Date: | Re: smtp authentication problem, Tony Stoneley |
---|---|
Next by Date: | Re: smtp authentication problem, Ken Hornstein |
Previous by Thread: | Re: mhfixmsg character set conversion, David Levine |
Next by Thread: | Re: mhfixmsg character set conversion, Steven Winikoff |
Indexes: | [Date] [Thread] [Top] [All Lists] |