perl-unicode

Re: iso-2022-jp, adding encodings..

2001-06-16 00:43:12
on 01.6.15 9:04 AM, Edward Peschko at edwardp(_at_)excitehome(_dot_)net wrote:
How exactly do you add a new charset map to Unicode::Map? Where do you get the
encodings from? Where are they defined?

I saw your reference to ftp://ftp.unicode.org/MAPPINGS, but that just points
to a file, not a directory of mapping sets.

All I'm trying to do is convert from UTF8 to iso-2022-jp ( the form of shift
jis that is used in email...) any help on how to do this would be greatly
appreciated...

  If that is the deal, just use Jcode available via CPAN.  And

    use Jcode;

    $iso_2022_jp = jcode($utf8, 'utf8')->iso_2022_jp;

  Unfortunately, Unicode::String (and Unicode::Map8) cannot handle 16-bit
CJKV strings (and that's the reason why I developed Jcode).
  Also note iso-2022-jp is NOT shift jis.  It is jis encoding without
so-called hankaku kana.
  If you want to learn more about CJKV encodings, see Ken Lunde's book

http://www.oreilly.com/catalog/cjkvinfo/

Dan the Man with Too Many Charsets to Handle

--
_____  Dan Kogai
  __/ ____   CEO, DAN co. ltd.
 /__ /-+-/  2-8-14-418 Shiomi Koto-ku Tokyo 135-0052 Japan
   /--/--- mailto: dankogai(_at_)dan(_dot_)co(_dot_)jp / http://www.dan.co.jp/ 
---------
__/  /    Tel:+81 3-5665-6131   Fax:+81 3-5665-6132
         PGP Key: http://www.dan.co.jp/~dankogai/dankogai.pgp.asc


<Prev in Thread] Current Thread [Next in Thread>