perl-unicode

Re: Unicode::Normalize surprise with dotless i

2002-09-05 07:30:05
Andreas J. Koenig <andreas(_dot_)koenig(_at_)anima(_dot_)de> writes:
Hi, Tomoyuki,

is it a bug in Unicode::Normalize or in my code: I expected that for
combining a circumflex with a small letter i, I'd have to use the
dotless i, but to my surprise, NFC refuses to combine with the dotless
i. Here's a demo progam:

% perl -le '
use Unicode::Normalize;
use Encode;
use charnames ":full";
for my $e (qw(ascii)){
 print Encode::encode($e,
   NFKC("combining with i: i\N{COMBINING CIRCUMFLEX ACCENT}
combining with dotless i: \N{LATIN SMALL LETTER DOTLESS I}\N{COMBINING 
CIRCUMFLEX ACCENT}"),
   Encode::FB_PERLQQ); 
}
'
combining with i: \x{00ee}
combining with dotless i: \x{0131}\x{0302}


What do you think?

Makes sense to me. U+00EE is "LATIN SMALL LETTER I WITH CIRCUMFLEX"
not "LATIN SMALL LETTER DOTLESS I WITH CIRCUMFLEX"


-- 
Nick Ing-Simmons
http://www.ni-s.u-net.com/

<Prev in Thread] Current Thread [Next in Thread>