perl-unicode

Re: .enc docs comments [was Re: Encode's .enc files and a question]

2000-10-27 06:54:04
Philip Newton <newton(_at_)newton(_dot_)digitalspace(_dot_)net> writes:
On Thu, 26 Oct 2000, Mark Leisher wrote:

  Following the first page will be all the other pages, each in the same
  format as the first: one number identifying the page followed by 256
  double-byte Unicode (UCS-2) characters.  If a character in the encoding 
maps
  to the Unicode character 0000, it means that the character doesn't actually
  exist.  If all characters on a page would map to 0000, that page can be
  omitted.

This would mean that there is no good Unicode character to map ASCII 0x00
to. The obvious character is U+0000 "<control> = NULL", but that's
reserved here. So if I'm translating a string containing NULs, those
characters will be treated as "not-a-character"?

ASCII NUL 0x00, maps to U+0000 as a special case.


Cheers,
Philip
-- 
Nick Ing-Simmons <nik(_at_)tiuk(_dot_)ti(_dot_)com>
Via, but not speaking for: Texas Instruments Ltd.