Re: BOM and principle of least surprise

At 7:47 PM +0300 5/5/04, Jarkko Hietaniemi wrote:

 > My hope for fewer options is for reading input. That is, I'd like the

 default encoding for all inputs and outputs to be UTF8, unless it has


We tried this with perl 5.8.0 and the feedback was overwhelmingly
negative...  if people do "print chr 0xff" they do expect one byte,
not two.

Of course, but that's not the only way to have a single encoding forinput and output. For example, "use utf8" (or some newly-namedequivalent) could have effects on all file input and output in thescope. A different method would be to have all input and output bebinary, but to have standardized operator overload systems (this ismore cumbersome than the first suggestion).

What I don't want (and what we mostly have now) is a language wherethe programmer has to remember to ask "what encoding am I using" forevery input or output command. If I have a text processing program,it is likely that all input and output will be in my chosen encoding;those that aren't need to be read/written using different tools (suchas subroutines that have the new encoding specified for their scope).

<Prev in Thread]

Current Thread

[Next in Thread>

Previous by Date:

Re: BOM and principle of least surprise, Jarkko Hietaniemi

Next by Date:

Re: BOM and principle of least surprise, Larry Wall

Previous by Thread:

Re: BOM and principle of least surprise, Jarkko Hietaniemi

Next by Thread:

Re: BOM and principle of least surprise, Larry Wall

Indexes:

[Date] [Thread] [Top] [All Lists]