perl-unicode

Unicode::Normalize surprise with dotless i

2002-09-05 04:30:04
Hi, Tomoyuki,

is it a bug in Unicode::Normalize or in my code: I expected that for
combining a circumflex with a small letter i, I'd have to use the
dotless i, but to my surprise, NFC refuses to combine with the dotless
i. Here's a demo progam:

% perl -le '
use Unicode::Normalize;
use Encode;
use charnames ":full";
for my $e (qw(ascii)){
  print Encode::encode($e,
    NFKC("combining with i: i\N{COMBINING CIRCUMFLEX ACCENT}
combining with dotless i: \N{LATIN SMALL LETTER DOTLESS I}\N{COMBINING 
CIRCUMFLEX ACCENT}"),
    Encode::FB_PERLQQ); 
}
'
combining with i: \x{00ee}
combining with dotless i: \x{0131}\x{0302}


What do you think?

-- 
andreas

<Prev in Thread] Current Thread [Next in Thread>