On Thu, 8 Jun 1995, Patrik Faltstrom wrote:
At 09.35 95-06-08, Harald(_dot_)T(_dot_)Alvestrand(_at_)uninett(_dot_)no
wrote:
This forbids, among others, ISO 10646 UCS-2 and EBCDIC as text/plain
character sets.
I assumed that what is called UCS-2 is the same thing as what
in "The Unicode Standard, Version 1.1, Appendix F", is called
FSS-UTF, i.e. Filesystem Safe UCS Transformation Format.
No, UCS-2 is the 2-byte form of ISO/IEC 10646
UCS-4 is the 4-byte form of ISO/IEC 10646
UTF-1 is the first 16-to-(n*8) bits method for UCS; is in the
standard but is proposed to be removed
UTF-8 corresponds to FSS-UTF (exactly?) ((32 or 16)-to-(n*8) bits)
UTF-16 is the proposed "enlargement" of BMP (~20-to-(2*16) bits)
I'm not updated on the formal state of the latter two
proposals.
And then there is the Internet-experimental UTF-7
(MIME charset UNICODE-1-1-UTF-7, RFC 1642), 16-to-(n*7) bits.
---
Peter Svanberg, Email:
psv(_at_)nada(_dot_)kth(_dot_)se
Dept of Num An & CS,
Royal Inst of Tech Phone: +46 8 790 71 40
S-100 44 Stockholm, SWEDEN Fax: +46 8 790 09 30