ietf-822
[Top] [All Lists]

Re: distinguishing between utf-8 and gb18030

2003-01-17 14:52:29

In <20030116213605(_dot_)023d303d(_dot_)moore(_at_)cs(_dot_)utk(_dot_)edu> 
Keith Moore <moore(_at_)cs(_dot_)utk(_dot_)edu> writes:

As for untagged encodings: There can be only one winner in the end

I do not share your faith that both encodings won't be widely used, or that
they will converge to utf-8 within our lifetimes.

but if both encodings do appear without tagging, and the difference can
reliably be determined by heuristics, I do have faith that vendors will
implement those heuristics.

It has been reported that Mozilla already implements a suitable heuristic,
as least so far as being able to spot and act upon UTF-8 (I don't think it
claims to be able to distinguish the Chinese stuff from other non-UTF-8).

-- 
Charles H. Lindsey ---------At Home, doing my own thing------------------------
Tel: +44 161 436 6131 Fax: +44 161 436 6133   Web: http://www.cs.man.ac.uk/~chl
Email: chl(_at_)clw(_dot_)cs(_dot_)man(_dot_)ac(_dot_)uk      Snail: 5 
Clerewood Ave, CHEADLE, SK8 3JU, U.K.
PGP: 2C15F1A9      Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5