perl-unicode

Re: Encode's .enc files and a question

2000-10-25 09:28:49

    Nick> Following the first page will be all the other pages, each in the
    Nick> same format as the first: one number identifying the page followed
    Nick> by 256 double-byte Unicode characters.  If a character in the
    Nick> encoding maps to the Unicode character 0000, it means that the
    Nick> character doesn't actually exist.  If all characters on a page would
    Nick> map to 0000, that page can be omitted.

There may some day be a use for the Unicode codepoint 0x0000.  It might be
better to make this 0xFFFF, which is a guaranteed non-character in Unicode and
probably in ISO10646.
-----------------------------------------------------------------------------
Mark Leisher
Computing Research Lab            Cinema, radio, television, magazines are a
New Mexico State University       school of inattention: people look without
Box 30001, Dept. 3CRL             seeing, listen without hearing.
Las Cruces, NM  88003                            -- Robert Bresson