Untagged UTF-8 is good. It is more and more widely supported by
readers, and will eventually be the only format produced by writers.
It works.
In contrast, a mess of multiple character encodings---even with
tags--- is bad because of its unnecessary complexity. Throw away tags
and it's a disaster: you can't decode it.
I wish you every success in convincing the government of China that they
shouldn't have their own encoding.
--
Keith Moore http://www.cs.utk.edu/~moore/
27 February 1933 11 September 2001