on 01.6.15 9:04 AM, Edward Peschko at edwardp(_at_)excitehome(_dot_)net wrote:
How exactly do you add a new charset map to Unicode::Map? Where do you get the
encodings from? Where are they defined?
I saw your reference to ftp://ftp.unicode.org/MAPPINGS, but that just points
to a file, not a directory of mapping sets.
All I'm trying to do is convert from UTF8 to iso-2022-jp ( the form of shift
jis that is used in email...) any help on how to do this would be greatly
If that is the deal, just use Jcode available via CPAN. And
$iso_2022_jp = jcode($utf8, 'utf8')->iso_2022_jp;
Unfortunately, Unicode::String (and Unicode::Map8) cannot handle 16-bit
CJKV strings (and that's the reason why I developed Jcode).
Also note iso-2022-jp is NOT shift jis. It is jis encoding without
so-called hankaku kana.
If you want to learn more about CJKV encodings, see Ken Lunde's book
Dan the Man with Too Many Charsets to Handle
_____ Dan Kogai
__/ ____ CEO, DAN co. ltd.
/__ /-+-/ 2-8-14-418 Shiomi Koto-ku Tokyo 135-0052 Japan
/--/--- mailto: dankogai(_at_)dan(_dot_)co(_dot_)jp / http://www.dan.co.jp/
__/ / Tel:+81 3-5665-6131 Fax:+81 3-5665-6132
PGP Key: http://www.dan.co.jp/~dankogai/dankogai.pgp.asc