perl-unicode

Re: 5.8 roadmap and Encode

2002-03-04 07:35:07
On Mon, Mar 04, 2002 at 10:06:39PM +0800, Autrijus Tang wrote:
On Sat, Mar 02, 2002 at 08:14:12PM +0800, Autrijus Tang wrote:
Thanks! I'm done with conversion and tested against libiconv. Patch follows;
files are available at <http://autrijus.org/ucm.tar.gz>.

Also, the resulting file size is quite hefty:

-rw-r--r--  1 root  512  1688107 Mar  2 19:51 euc-tw.ucm
-rw-r--r--  1 root  512  1543333 Mar  2 19:51 gb18030.ucm

And they add ~600k to the compressed perl distribution. Is that acceptable?

That's bit too much, I'm afraid.

So, what's the verdict on this? Should I make these two into a CPAN module?

Yes.

I wonder what the name should be... Encode::HanPack?

Encode::HanExtra?

In somewhat related news, I've made two additional maps (big5-simp and 
euc-cn-trad) available as Encode::HanConvert on CPAN; they maps the 
traditional
& simplified encodings into the *opposite* unicode characters, to acheive
a more lossless mapping than directly possible via from_to(). Just FYI. :)

/Autrijus/



-- 
$jhi++; # http://www.iki.fi/jhi/
        # There is this special biologist word we use for 'stable'.
        # It is 'dead'. -- Jack Cohen

<Prev in Thread] Current Thread [Next in Thread>