perl-unicode

Re: [EXPERIMENTAL] 1st draft of Encode

2000-09-11 12:45:29
On Mon, Sep 11, 2000 at 02:13:33PM -0500, Jarkko Hietaniemi wrote:
There isn't anything to test this with (I did say 'very rough').
Please read Encode.pm.  Mainly I'm interested hearing comments whether
this is a good interface, something that could be used to replace
Unicode::Map8 (lots of table-driven conversions, for 8-bit legacy
character sets), and when we feel up to it, Unicode::Map (lot of
algorithmic conversions, for Eastern Asian encodings)

I *like* the interface to Encode, I think that's what we want. I'll send
patches to the functionality and some tests later on.

I've been working on the other end of it, which is the conversion to and from
other character sets - basically, the plan is to derive the data from the
Unicode conversion tables, but I haven't decided whether it's better to do
that in C or Perl; I've been doing it in C for speed, although all that data
in a header file whacks up the memory resident size.

The legacy 8-bit stuff is trivial, and when my copy of CJKV Information
Processing arrives, (Hi Jon!) I'll be able to finish off the Unihan business.

Oh, and from_to gets fun if you try and do it by smart-combining the mappings
instead of going through an intermediary character set. :)

-- 
Complete the following sentence: People *ought* to weigh bricks, cats
and cinnamon in the same units because... - Ian Johnston