Hello Jungshik!
JS> MS should have registered CP949/950 as Windows-949/950
JS> instead of labeling them misleadingly as ks_c_5601-1987 and big5, In case
JS> of gb2312, gbk should be registered and used. I don't know about big5,
JS> but in Korean case, apparently they tried to pretend that they follow
JS> Korean Nat'l std. while they extended it in a proprietary way.
I used to think that the trouble is as the following:
ks_c_5601-1987 is a coded character set, maybe an implied
raw 2 bytes per char, 7-bit Character Encoding Scheme
without any control characters and without the space
as defined by RFC1345, while
EUC-KR is a Character Encoding Scheme built upon
ks_c_5601-1987 that encodes ks_c_5601-1987 chars as
pairs of 0xA1-0xFE octets and uses 0x21-0xFE for ASCII
then the trouble is not that they extended ks_c_5601-1987
but that they have used a Coded Character Set name as
as an alias to a CES built on top of that CCS. Isn't it?
- Anton