On Thu, 26 Oct 2000, Mark Leisher wrote:
Following the first page will be all the other pages, each in the same
format as the first: one number identifying the page followed by 256
double-byte Unicode (UCS-2) characters. If a character in the encoding maps
to the Unicode character 0000, it means that the character doesn't actually
exist. If all characters on a page would map to 0000, that page can be
omitted.
This would mean that there is no good Unicode character to map ASCII 0x00
to. The obvious character is U+0000 "<control> = NULL", but that's
reserved here. So if I'm translating a string containing NULs, those
characters will be treated as "not-a-character"?
Cheers,
Philip
--
Philip Newton <newton(_at_)newton(_dot_)digitalspace(_dot_)net>