perl-unicode

Re: good name for characters matching [^\0-\377]?

2007-10-18 12:24:42
Juerd Waalboer wrote:
E R skribis 2007-10-18  9:50 (-0500):
I'm preparing a presentation about Perl and Unicode support, and I'd
like to give a name for characters with ordinals above 255. Is there a
good name for that class?

They are "characters outside the latin-1 range".

Latin-1 has nothing to do with it.  There are countless legacy character
sets that use the code points from 32 to 255, and besides, what
maquerades as Latin-1 in various environments rarely is strict iso-8859-1

How about "extended characters"???

Bad name, because it would suggest an actual barrier, which in unicode
isn't there.

Bad name also because the legacy character sets are often referred to as
extensions to ASCII up to 255 or below.

Above that they are multi-byte characters, but that doesn't mean they're necessarily Unicode, since the CJK legacy character sets are also multi-byte.

JD