perl-unicode

Re: utf8::valid and \x14_000 - \x1F_0000

2008-03-12 13:58:23
Chris Hall skribis 2008-03-12 20:49 (+0000):
  a. are you saying that characters in Perl are Unicode ?

Yes. They are called Unicode, at least. This has my preference for
explanation and documentation.

  b. or are you agreeing that characters in Perl take values
     0..0x7FFF_FFFF (or beyond), which are generally interpreted as
     UCS, where required and possible ?

This too. This is the more technically accurate explanation, and has my
preference for implementation.

If (a) then characters with ordinals beyond 0x10_FFFF should throw 
warnings (at least) since they clearly are not Unicode !

Perl just has a somewhat broad definition of "unicode", that is not
the same as the official unicode character set.

It has its own utf8, it can have its own unicode too :)
-- 
Met vriendelijke groet,  Kind regards,  Korajn salutojn,

  Juerd Waalboer:  Perl hacker  <#####(_at_)juerd(_dot_)nl>  
<http://juerd.nl/sig>
  Convolution:     ICT solutions and consultancy 
<sales(_at_)convolution(_dot_)nl>