perl-unicode

Re: Starnge characters when displaying html files saved in UTF-8 format

2001-12-17 00:25:20
On Tue, 11 Dec 2001 13:24:46 -0800, bstell(_at_)ix(_dot_)netcom(_dot_)com 
(Brian Stell)
wrote:

The BOM is valid as the *first* character. I'm not sure what the
spec says about subsequent chars.

As I understand it, 0xFEFF leads a double life: it's either "zero width
no-break space" or "byte order mark". If it's the very first character
in text, it's a BOM, otherwise, it's a zero width no-break space.

So it's a valid character in the middle of text, it just isn't a BOM. At
least, that's how I understand it.

Cheers,
Philip