perl-unicode

Re: BOM and principle of least surprise

2004-05-14 01:30:09
Jarkko Hietaniemi (jhi(_at_)iki(_dot_)fi) writes:
Nick Ing-Simmons wrote:
This thread started as complaint that perl5 can't read a 
script saved as UCS-2/UTF-16 or whatever Windows uses.

Uh, really?  Perl 5.8+ should be able to do that, automatically.

To be able to that, it would have have to understand byte-order marks (which 
it doesn't). I think there was a suggestion that you could specify an
option on the command-line to specify the encoding of the script file,
but I haven't tried. 
 
I thought the issue was about Perl not automatically guessing the
UTF-16 encoding of input data.

Both input data and the script. Just because the script has been saved
in UTF-8, does not mean that literals in the script are taken as UTF-8.


-- 
Erland Sommarskog, Stockholm, sommar(_at_)algonet(_dot_)se