perl-unicode

Re: Encode.pm question

2003-05-16 09:30:13
because it is there, UTF-7, though hardly ever used, is a UTF.  jhi has 
recently looked around if there is any more interesting encodings that 
Encode should support but he told me "nothing interesting".   I think I 

"Interesting" was probably a bad choice of words... what I did is that
I basically looked around for lists of character sets beyond those of
Encode and found a few, most importantly GNU recode, and tried to find
some _obvious_ missing character sets.  I found two big legacy
"supersets" (the European ISO-646 variants from the eighties, like
e.g. ISO-646-DK, and the few dozen EBCDIC variants), and some cute
ones (from the historical viewpoint, like the DEC MCS and the HP
calculator character sets), but nothing obvious was missing.
(Disclaimer: since I don't know that much about non-European character
sets, I might do some disservice here to e.g. Arabic or Indic users.)

can squeeze UTF-7 support in the next release.

One of the reasons I procrastinated from adding UTF-7 support was that 
I had no raw UTF-7 data to compare to and no application to see the 
result of.  Now that I know Unicode::String supports that I feel more 
obliged to add the support.

But before I commit the release I will definitely release a patch here 
so "no one" like you can test it.  So be my (alpha|beta) tester, please.

-- 
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> http://www.iki.fi/jhi/ "There is this 
special
biologist word we use for 'stable'.  It is 'dead'." -- Jack Cohen