perl-unicode

[FYI] JIS X 0213:2004

2004-02-29 08:30:05
Hello, all.

JIS X 0213 is revised at Feb 20, 2004 (as Amendment 1).

Short summary:

* Notorious parenthesized UCS code points
  (Unexisting UCS code points were given in parentheses) are gone.

* 25 JIS characters are mapped to a sequence of two UCS characters.

These mappings are identical to ones I had been reported here:
cf. http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2002-04/msg00024.html

* Mapping of JIS 2-93-27 (plane-row-cell) is fixed to U+9B1C,
  in concordance with Unicode 3.2.0.
  In JIS X 0213:2000, it was mapped to U+9B1D.

* Representative gryphs for 168 Kanji are revised
  (their mappings to UCS are not changed).

* 10 kanji are added to level 3. (total: 11233 graphic characters)
All new kanji are one of variants for a level 1 or 2 kanji.
These variant pairs have not been unified in UCS.

p-r-c     SJIS     UCS      /  its variant (level 1 or 2)
1-14-1    0x879F   U+4FF1   /   1-22-70, U+5036
1-15-94   0x889E   U+525D   /   1-39-77, U+5265
1-47-52   0x9873   U+20B9F  /   1-28-24, U+53F1
1-47-94   0x989E   U+541E   /   1-38-61, U+5451
1-84-7    0xEAA5   U+5653   /   1-17-19, U+5618
1-94-90   0xEFF8   U+59F8   /   1-53-11, U+598D
1-94-91   0xEFF9   U+5C5B   /   1-54-2,  U+5C4F
1-94-92   0xEFFA   U+5E77   /   1-54-85, U+5E76
1-94-93   0xEFFB   U+7626   /   1-33-73, U+75E9
1-94-94   0xEFFC   U+7E6B   /   1-23-50, U+7E4B

* Encoding names are revised.
(oops, these names are suffixed with "2003",
 but the revision is published in 2004! Too late!).

(Traditional)   JIS X 0213:2000    JIS X 0213:2004
-----------------------------------------------------
Shift_JIS       Shift_JISX0213     Shift_JIS-2003
EUC-JP          EUC-JISX0213       EUC-JIS-2003
ISO-2022-JP     ISO-2022-JP-3      ISO-2022-JP-2003

* Escape sequences for JIS X 0213:2004 Plane 1 is revised
 (It's a proposal? I don't know when it will be registered so).

      New final byte : 5/1 (ascii 'Q')
      Old final byte : 4/15 (ascii 'O')

  Final byte for Plane 2 is left as it was [5/0 (ascii 'P')].

SADAHIRO Tomoyuki

<Prev in Thread] Current Thread [Next in Thread>
  • [FYI] JIS X 0213:2004, SADAHIRO Tomoyuki <=