perl-unicode

Re: [EXPERIMENTAL] 1st draft of Encode

2000-09-12 06:20:18
On Mon, Sep 11, 2000 at 08:25:37PM -0700, Ed Batutis wrote:
Please read Encode.pm.  Mainly I'm interested hearing comments whether
this is a good interface...

I like the interface. No complicated options. 

It might be more useful if the default for the non-utf-8 characters
were the system-defined default character encoding of the process
-rather than iso latin 1. I suspect there are various difficulties
with this, but it would be more useful for international users.

I can understand the request but the problem is that for this to work
the legacy eight-bit mappings must first be implemented.  (E.g. if
the native character encoding is DOS CPXXXX, or Latin 2, or KOI8-R,
before we can hope to map it to utf8, we must map it the full UCS-2
Unicode anyway...)

Perhaps it would be difficult to use ICU as a utility library for
Perl, but it seems on the surface to make sense. It is open source
(some flavor thereof anyway). There are a lot of converters
implemented and they are very fast. There is a mechanism to allow
loading only the converter tables that are required. (There's also
lots of locale support stuff there too, which might also be useful.)

Yes, this is noted down in the file Todo-5.6.  The biggest problem is that
the ICU will not be everywhere.

-- 
$jhi++; # http://www.iki.fi/jhi/
        # There is this special biologist word we use for 'stable'.
        # It is 'dead'. -- Jack Cohen

<Prev in Thread] Current Thread [Next in Thread>