Hello, Jungshik!
1) GB 18030, CNS 11643-1992
JS> I guess the official designation is GB 18030-2000.
JS> I believe they're CNS 11643-1992 or CNS 11643-1986.
Ken's online has the later one CNS 11643-1992
(the reason I asked you, Junghik, was I was not sure if Ken's online
has the official names or some slang)
Wonder still if EUC-TW uses -1992 or -1986?
JS> You have to be careful with IANA registration. In a sense,
JS> it's like a sink that accepts everything thrown into it :-)
JS>
JS> ... IANA didn't consider much when something is given to them.
JS> They just add to the list almost whatever is given to them.
JS> ...
JS> You should not give too much weight to IANA registry.
Okay, let us practice statistical approach. It should work
well on a junk-yard :-)
94
KS C 5636-1993 -> KSC5636
KS C 5636-1989 -> KSC5636
94x94
JIS C 6226-1983 -> JIS_C6226-1983
JIS X 0208:1983 -> JIS_X0208-1983
JIS X 0208:1990
JIS X 0208:1997
KS C 5601-1987 -> KS_C_5601-1987
KS C 5601-1989 (revised)
KS C 5601-1992 (revised)
KS X 1001:1997 (reissued)
KS X 1001:1998 (two chars added including euro)
In the majority of cases the first ' ' becomes '_'.
This is in line with RFC 2047's "encode-word" syntax, in the ?Q mode
that allowes spaces to be =20 or _ :-)
The second space treatment is inconsistent, but fortunately
GB 18030-2000 and CNS 11643-1992 do not have it, so, by
interpolation, if IANA ever registers these an "educated" guess is
GB 18030-2000 -> GB_18030-2000
CNS 11643-1992 -> CNS_11643-1992
Thank you, Jungshik for helping me with this!
And I'm now putting the official standad's names into my
survey for JIS * and KS * series.
2)
JS> JIS C -> JIS X early 1980's,
JS> KS C -> KS X 1997
BTW, Ken's online also has a JIS X 0201-1976 entry.. Wonder if
it should have really been JIX X 0201:1976
3)
JS> Moreover, the year a standard is issued used to be preceded
JS> by '-', but now is preceded by ':' as in ISO standards
Oh, my!!! Thanks a lot! Would have never caught that on my own!
Don't know if the up-to-date printed book by Ken has it, but
his old cjk.inf available on like uses only '-'-s everywhere!!
I guess ":" is not allowed in the charset parameter of a MIME
name, so all IANA registrations I guess are going to have a '-'
in that position.
BTW have seen it many times people are writing JIS X0208..
P.S.
AT> Writing a bit of an article...
JS> I strongly recommend you get CJKV Information Processing by Ken Lunde.
Have read what all his on-line variant
http://www.oreilly.com/people/authors/lunde/cjk_inf.html
several years old, but very useful for putting on track!
My first reading on CJK :-)
JS> As Dan wrote in his dropped document on encodings, ISO-2022 standard
JS> (ECMA 35 at http://www.ecma.ch) itself is a great(?!) read :-)
Supported.pod still recommends that, I hope :-)