Chris Hall skribis 2008-03-12 20:49 (+0000):
a. are you saying that characters in Perl are Unicode ?
Yes. They are called Unicode, at least. This has my preference for
explanation and documentation.
b. or are you agreeing that characters in Perl take values
0..0x7FFF_FFFF (or beyond), which are generally interpreted as
UCS, where required and possible ?
This too. This is the more technically accurate explanation, and has my
preference for implementation.
If (a) then characters with ordinals beyond 0x10_FFFF should throw
warnings (at least) since they clearly are not Unicode !
Perl just has a somewhat broad definition of "unicode", that is not
the same as the official unicode character set.
It has its own utf8, it can have its own unicode too :)
--
Met vriendelijke groet, Kind regards, Korajn salutojn,
Juerd Waalboer: Perl hacker <#####(_at_)juerd(_dot_)nl>
<http://juerd.nl/sig>
Convolution: ICT solutions and consultancy
<sales(_at_)convolution(_dot_)nl>