perl-unicode

Re: Converting string to UTF-16LE

2004-03-02 19:30:06
If I can recall correctly, the case tables were in response to the Unicode CaseFolding table (lib/unicore/CaseFolding.txt) which does indeed define language-independent foldings that more complex than usual (mostly caused by encoding irregularities in Unicode) Maybe just the placement of those
tables is wrong.  Again, there is nothing language-specific in those.

... and following the CaseFolding.txt is required in the the Unicode regular expression guidelines (http://www.unicode.org/unicode/reports/tr18/), the "Default Loose Matches"
(http://www.unicode.org/unicode/reports/tr18/#Default_Loose_Matches)

--
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> http://www.iki.fi/jhi/ "There is this special
biologist word we use for 'stable'.  It is 'dead'." -- Jack Cohen