Re: Encode::Guess fails on UTF-16BE string w/ newline characters

Jay,

Thanks for your report. I have to confess UTF-16(BE|LE) as possiblesuspects.


On Sunday, April 13, 2003, at 02:59  AM, Jay Lawrence wrote:

Points
- what is the best way to open and read data that might be: UTF-8,UTF-16, UTF-16BE, or UTF-16LE?- is there a good way to chop the line endings reliably for the above4 sets?- maybe detecting the flavour of unicode is better left to adifferent process?
                Encode::Guess::Unicode?

One possible solution is to detect the presence of \x00 and whendetected we assume UTF-(16|32)(BE|LE). The ones with BOM is alreadysupported.

Plz advise - perhaps just documentation expansion is necessary and canhelp w/ that based on this matter.


I definitely will.

Dan the Encode Maintainer

<Prev in Thread]

Current Thread

[Next in Thread>

Previous by Date:

Encode::Guess fails on UTF-16BE string w/ newline characters, Jay Lawrence

Next by Date:

Re: Encode::Guess fails on UTF-16BE string w/ newline characters, Dan Kogai

Previous by Thread:

Encode::Guess fails on UTF-16BE string w/ newline characters, Jay Lawrence

Next by Thread:

Re: Encode::Guess fails on UTF-16BE string w/ newline characters, Dan Kogai

Indexes:

[Date] [Thread] [Top] [All Lists]