perl-unicode

Re: 5.8 roadmap and Encode

2002-02-28 20:04:53
On Thu, Feb 28, 2002 at 08:51:45PM +0200, Jarkko Hietaniemi wrote:
I think we should aim at the very last to keep up with
a Certain Language:

http://java.sun.com/j2se/1.4/docs/guide/intl/encoding.doc.html

Based on a quick check, we are missing the following compared
with the J2SE 1.4 list:

---

Big5_HKSCS      Big5 with Hong Kong extensions
Cp273           IBM Austria, Germany
Cp277           IBM Denmark, Norway
Cp278           IBM Finland, Swedene
Cp280           IBM Italy
Cp284           IBM Catalan/Spain, Spanish Latin America
Cp285           IBM United Kingdom
Cp297           IBM France
Cp420           IBM Arabic
Cp500           EBCDIC 500V1
Cp838           IBM Thailand extended SBCS
Cp868           MS-DOS Pakistan
Cp870           IBM Multilingual Latin-2
Cp871           IBM Iceland
Cp875           IBM Thai
Cp918           IBM Greek
Cp921           IBM Pakistan (Urdu)
Cp922           IBM Estonia (AIX, DOS)
Cp930           Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5025
Cp933           Korean mixed with 1880 UDC, superset of 5029
Cp935           Simplified Chinese Host mixed with 1880 UDC, superset of 5031
Cp937           Traditional Chinese Host mixed with 6204 UDC, superset of 5033
Cp939           Japanese Latin Kanji mised with 4370 UDC, superset of 5035
Cp942           IBM OS/2 Japanese, superset of Cp932
Cp942C          Variant of Cp942
Cp943           IBM OS/2 Japanese, superset of Cp932 and Shift-JIS
Cp943C          Variant of 943C
Cp948           OS/2 Chinese (Taiwan) superset of 938
Cp949           PC Korean
Cp949C          Variant of Cp949
Cp964           AIX Chinese (Taiwan)
Cp970           AIX Korean
Cp1025          IBM Multilingual Cyrillic
Cp1026          IBM Turkey
Cp1046          IBM Arabic Windows
Cp1097          IBM Farsi/Persian
Cp1098          IBM Farsi/Persian (PC)
Cp1112          IBM Latvia, Lithuania
Cp1122          IBM Estonia
Cp1123          IBM Ukraine
Cp1124          IBM AIX Ukraine
Cp1140          Cp037 with Euro
Cp1141          Cp273 with Euro
Cp1142          Cp277 with Euro
Cp1143          Cp278 with Euro
Cp1144          Cp280 with Euro
Cp1145          Cp284 with Euro
Cp1146          Cp285 with Euro
Cp1147          Cp297 with Euro
Cp1148          Cp500 with Euro
Cp1149          Cp871 with Euro
Cp1381          IBM OS/2, DOS PRC
Cp1383          IBM AIX PRC
Cp33722         IBM euc-JP
GB18030         Simplified Chinese PRC
GBK             GBK Simplified Chinese
ISCII91         Indic scripts
ISO2022CN_CNS   CNS 11643 in ISO 2022 CN form
ISO2022CN_GB    GB 2312 in ISO 2022 CN form
JISAutoDetect   Detect and convert from Shift-JIS, EUC-JP, ISO 2022 JP
Johab           Korean
MacArabic       Arabic
MacHebrew       Hebrew
Big5_Solaris    Big5 plus even additional characters from Solaris zh_TW.BIG5

-- 
$jhi++; # http://www.iki.fi/jhi/
        # There is this special biologist word we use for 'stable'.
        # It is 'dead'. -- Jack Cohen

<Prev in Thread] Current Thread [Next in Thread>