perl-unicode

Re: .enc docs comments [was Re: Encode's .enc files and a question]

2000-10-27 03:49:35
On Thu, 26 Oct 2000, Mark Leisher wrote:

  Following the first page will be all the other pages, each in the same
  format as the first: one number identifying the page followed by 256
  double-byte Unicode (UCS-2) characters.  If a character in the encoding maps
  to the Unicode character 0000, it means that the character doesn't actually
  exist.  If all characters on a page would map to 0000, that page can be
  omitted.

This would mean that there is no good Unicode character to map ASCII 0x00
to. The obvious character is U+0000 "<control> = NULL", but that's
reserved here. So if I'm translating a string containing NULs, those
characters will be treated as "not-a-character"?

Cheers,
Philip
-- 
Philip Newton <newton(_at_)newton(_dot_)digitalspace(_dot_)net>