Re: [perl #22111] perl::Encode doesn't handle UTF-8 NFD strings

On 2003-05-06 at 21:21 +0900 Dan Kogai sent off:

If perl is an application like, say, a word processor, I would agreethat perl and Encode should handle Normalization internally andtransparently so "canonically-equivalent" strings compare as equal.But perl is a PROGRAMMING LANGUAGE so you have to be able to treatdifferent (though may be equivalent Unicode-wise) things different bydefault. Otherwise you can't even implement new normalization in perl.So I do not consider this as a bug since perl 5.8 comes with bothEncode and Unicode::Normalize.


this gives a chance to workaround this bug (yes, I think it is).

If you want to do it transparently, you can always use Encode::Encodingto implement your own. Here is an example.

well, see: from_to claims to convert from encoding1 to encoding2.encoding1 in this case is utf-8. Also the non-composed UTF-8 isperfectly valid UTF-8 and there's absolutely no reason, whyfrom_to($string,"utf8","latin1") should not work just because I usedthe NFD form and not the NFC form. Your example is just a way to workaround this bug but from_to should not care if the initial string isNFC or NFD.


Bjoern

<Prev in Thread]

Current Thread

[Next in Thread>

Previous by Date:

Re: [perl #22111] perl::Encode doesn't handle UTF-8 NFD strings, Jarkko Hietaniemi

Next by Date:

Re: [perl #22111] perl::Encode doesn't handle UTF-8 NFD strings, SADAHIRO Tomoyuki

Previous by Thread:

Re: [perl #22111] perl::Encode doesn't handle UTF-8 NFD strings, Dan Kogai

Next by Thread:

Re: [perl #22111] perl::Encode doesn't handle UTF-8 NFD strings, Jarkko Hietaniemi

Indexes:

[Date] [Thread] [Top] [All Lists]