Marco,
Thank you for elaborating my points.
On 2002.02.02, at 01:40, Marco Cimarosti wrote:
<< The entire former contents of this directory are obsolete and have
been
moved to the OBSOLETE directory. The latest information may be found
in the Unihan.txt file in the latest Unicode Character Database.
August 1, 2001. >>
And don't bother to download the 23 Mb
<http://www.unicode.org/Public/UNIDATA/Unihan.txt> file, because it
contains
only mappings for kanji's.
Yes. That's the point #0. Unihan.txt is no replacement for
MAPPINGS. Maybe I can come up with a script which generates a table out
of it but this kind of attitude is far from nice.
And Unihan.txt also lacks 8bit mappings like JISX-0201.
So, go directly to
<http://www.unicode.org/Public/MAPPINGS/OBSOLETE/EASTASIA/>, where you
can
find the old data, along with a note about mapping errors:
But this time, they are right about being OBSOLETE.
Below is some analysis by Asmus Freytag of specific problems raised by
T.
Kubota in this document:
http://www.debian.or.jp/~kubota/unicode-symbols.html
English version also available as
http://www.debian.or.jp/~kubota/unicode-symbols.html.en
And let me quote the part which is significant.
ASCII and JIS X 0201 Roman
When converting EUC-JP and Shift_JIS, handling of 0x5c and 0x7e can be
a problem. Since both encodings have long history and Japanese people
have lot of experience how to handle them, I now introduce it.
Solution is very simple. Just regard YEN SIGN and REVERSE SOLIDUS as a
different glyphs of the same character. Then, distinction between ASCII
and JIS X 0201 Roman can be neglected.
Has anyone of Unicode Consortium seen this one?
Dan