perl-unicode

[Encode] cp9??.ucm regenerated [Was: Re: [Patch] Encode.pm : euro sign missing in cp936.ucm]

2003-03-27 11:30:06
CJKT experts,

To address the issue first raised by SADAHIRO-san for missing EURO SIGN in CP936 map that leads to the further doubts for the correctness of the original data for cp9?? in either unicode.org or microsoft.com, I have regenerated the map again based upon

http://oss.software.ibm.com/cvs/icu/charset/data/ucm/

Which is available as

http://www.dan.co.jp/~dankogai/ucm-cp9xx.tar.gz

Actually it is 100% identical for CHARMAP up to END CHARMAP (well, comment fortified to the taste of the previous map). It passes make test and t/rt.pl. But I have already suffered from the Elusive MS Document Syndrome, I would like you guys to evaluate the new mapping before I commit.

I found the new map more different than I thought. The biggest difference is that the new version does use '|1' fallback (in addition to '|3'), which I believe better approximates what MS software products do.

Oh, EURO SIGN issue is resolved, at least :-P

Dan the Encode Maintainer

P.S.  Autrijus, please also check the Big5 issue.
Message-Id: <2624B7F4-5ECA-11D7-8C06-000393AE4244(_at_)dan(_dot_)co(_dot_)jp>

<Prev in Thread] Current Thread [Next in Thread>