perl-unicode

ucm/cp???.ucm will be updated

2002-10-18 10:30:06
Autrijus and others,

On Friday, Oct 18, 2002, at 22:21 Asia/Tokyo, Dan Kogai wrote:
[2] http://www.microsoft.com/typography/unicode/cscp.htm
[3] http://www.microsoft.com/typography/unicode/932.txt

[snip]

The URI [2] also has links to other code pages so I would also like to review them and if neccessary, update them. 8 bit code pages (CP12??) seem OK but other CJK (CP9??) needs reviews.

So I did to 932 (JP), 936 (CN), 949 (KR), and 950 (TW). The new maps generated via http://www.microsoft.com/typography/unicode/9??.txt all seem to pass roundtrip tests in t/CJKT.t but 936 and 950 fails in t/at-cn.t and t/at-tw.t.

Those are tests originally submitted as a patch to t/CJKT.t by Autrijus a long ago then wound up in where they are now.

I found those tests rather obsolete but I am no expert in those encodings tested there. So I would like you to review them at

http://www.dan.co.jp/~dankogai/bleedperl/cp-cjk/

You can also find my crude script that was used for conversion as

http://www.dan.co.jp/~dankogai/bleedperl/cp-cjk/ms2ucm.pl

Xie4Xie4Ge3Zuo1 !

Dan the Encode Maintainer