perl-unicode

Re: [perl #22111] perl::Encode doesn't handle UTF-8 NFD strings

2003-05-06 06:30:05

this gives a chance to workaround this bug (yes, I think it is).

And I think it is not.  Normalization should not magically be done.
If the Unicode string has been normalized to form D and some
originally composed characters have been decomposed, it is no more
the same string as the original.

well, see: from_to claims to convert from encoding1 to encoding2. 
encoding1 in this case is utf-8. Also the non-composed UTF-8 is 
perfectly valid UTF-8 and there's absolutely no reason, why 
from_to($string,"utf8","latin1") should not work just because I used 
the NFD form and not the NFC form. Your example is just a way to work 

You are assuming the equivalence of (pre)composed characters and
their composed forms.  Perl doesn't do this at any level.

around this bug but from_to should not care if the initial string is 
NFC or NFD.

-- 
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> http://www.iki.fi/jhi/ "There is this 
special
biologist word we use for 'stable'.  It is 'dead'." -- Jack Cohen