perl-unicode

Re: BOM and principle of least surprise

2004-05-05 10:30:08

Ummm, why?  Giving fewer options to users has never been a strong Perl
tradition :-)  Besides, recognizing the various Unicode encodings is
pretty trivial, especially if we know something that's likely to be
present in the first line, like "perl".

Or the "#!" is pretty good marker too.

My mistake. When I saw "script" in the first line, I assumed we were 
talking about subsections of Unicode, not "script as in a program". 
You're right about doing a guess based on looking for "perl" being 
pretty definitive.

My hope for fewer options is for reading input. That is, I'd like the 
default encoding for all inputs and outputs to be UTF8, unless it has 

We tried this with perl 5.8.0 and the feedback was overwhelmingly
negative...  if people do "print chr 0xff" they do expect one byte,
not two.

been converted and that conversion is somehow flagged.


-- 
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> http://www.iki.fi/jhi/ "There is this 
special
biologist word we use for 'stable'.  It is 'dead'." -- Jack Cohen