perl-unicode

Re: possible regexp feature for 5.6: "ignore diacritics"

1999-10-18 02:11:07
Russ Allbery <rra(_at_)stanford(_dot_)edu> writes:
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> writes:
Russ Allbery writes:

Remind me; under POSIX, isn't [=ss=] supposed to work and match ß?

But this is Ilya's \N{}, right? :-)

Honestly, I don't remember.  Darn it, I must get my hands to a copy of
1003.2...

Found where I'd read about it.  Friedl, pp. 80-81.

You might want to take a look at the Base Definitions of the UNIX 98
standard which is a superset of a lot of POSIX.1/2:

    http://opengroup.org/onlinepubs/7908799/

Some vendors also provide the source for their locale
definitions, e.g. on HP-UX 11.00 in /usr/lib/nls/loc/src
(after installing patches like e.g. 
 ftp://us-ffs.external.hp.com/hp-ux_patches/s700_800/11.X/PHCO_16492
 ftp://us-ffs.external.hp.com/hp-ux_patches/s700_800/11.X/PHCO_18125
for Japanese or German UTF-8 locales -- these depots are tar files).

And of course the Unicode standard and the technical reports deal
with it on http://www.unicode.org/unicode/reports/techreports.html


Markus