On Sat, Aug 11, 2001 at 12:15:17PM +0900, Martin Duerst wrote:
At 11:01 01/08/10 -0500, Jarkko Hietaniemi wrote:
Reminder: please test (and comment) Sadahiro Tomoyuki's Unicode
normalization and collation modules, available at
http://homepage1.nifty.com/nomenclator/perl/indexE.htm
(There's also Unicode::Decompose by Simon Cozens, but unfortunately
the 0.01 release in CPAN is truncated and Simon is on the road for
a while.)
Normalization (D, KD) covers decomposition. Or is this about something
else?
The Unicode::Decompose module description says
This module implements Unicode normalization forms D and C.
and the interface is
use Unicode::Decompose qw(normalize order decompose recompose normalize_d);
$norm = normalize($string);
# OR:
$decomp = decompose($string);
$ordered = order($decomp);
$norm = recompose($ordered);
Regards, Martin.
--
$jhi++; # http://www.iki.fi/jhi/
# There is this special biologist word we use for 'stable'.
# It is 'dead'. -- Jack Cohen