Sadahiro-san and perl-unicode readers,
I am now working on Encode::JIS2K, an additional converter for JIS X
0213:2000. When I studied JIS X 0213, I found that for euc-jp, you can
make a map so that it covers both JIS X 0212 and JIS X 0213. I thought
they were mutually exclusive but they were not (there are some
duplicates, however. So it was not as straightforward as aggregating
two maps).
I have just finished making new euc-jp.ucm that behaves like this;
for euc-jp,
* Round-Trips for all JIS X 0201-kana, JIS X 0208 and JIS X 0212 (same
as before)
* Decode-only for those that appear only in JIS X 0213
Remind you that this new euc-jp.ucm is NOT THE SAME as euc-jp2k.ucm that
is to be included in Encode::JIS2K;
for euc-jisx0213,
* Round-Trips for all JIS X 0201-kana and JIS X 0213 (both planes)
* Decode-only for those that appear only in JIS X 0212
* Those that conflict with JIS X 0208 and JIS X 0213-plane1, JIS X 0213
definition is used. Only these 3 are different (so JIS X 0213-plane1
is ALMOST a superset of JIS X 0208).
euc-jp
<UFFE3> \xA1\xB1 |0 # FULLWIDTH MACRON
<U2015> \xA1\xBD |0 # HORIZONTAL BAR
<UFFE5> \xA1\xEF |0 # FULLWIDTH YEN SIGN
euc-jisx0213
<U203E> \xA1\xB1 |0 # OVERLINE
<U2014> \xA1\xBD |0 # EM DASH
<U00A5> \xA1\xEF |0 # YEN SIGN
In short, euc-jp and euc-jisx0213 differ only in encode() and decoders
can decode both euc-jp(1990) and euc-jisx0213.
If no one objects, I will use a new map for euc-jp in Encode-1.64 or
later and Encode::JIS2K is to follow.
Dan the Encode Maintainer