On Mon, Mar 04, 2002 at 10:06:39PM +0800, Autrijus Tang wrote:
On Sat, Mar 02, 2002 at 08:14:12PM +0800, Autrijus Tang wrote:
Thanks! I'm done with conversion and tested against libiconv. Patch follows;
files are available at <http://autrijus.org/ucm.tar.gz>.
Also, the resulting file size is quite hefty:
-rw-r--r-- 1 root 512 1688107 Mar 2 19:51 euc-tw.ucm
-rw-r--r-- 1 root 512 1543333 Mar 2 19:51 gb18030.ucm
And they add ~600k to the compressed perl distribution. Is that acceptable?
That's bit too much, I'm afraid.
So, what's the verdict on this? Should I make these two into a CPAN module?
Yes.
I wonder what the name should be... Encode::HanPack?
Encode::HanExtra?
In somewhat related news, I've made two additional maps (big5-simp and
euc-cn-trad) available as Encode::HanConvert on CPAN; they maps the
traditional
& simplified encodings into the *opposite* unicode characters, to acheive
a more lossless mapping than directly possible via from_to(). Just FYI. :)
/Autrijus/
--
$jhi++; # http://www.iki.fi/jhi/
# There is this special biologist word we use for 'stable'.
# It is 'dead'. -- Jack Cohen