[Encode] Encode::Perl?

On Monday, March 25, 2002, at 06:59 , Nick Ing-Simmons wrote:

It should not be too hard to take the .ucm file parsing from 'compile'
and teach Encode::Tcl-like all-perl code to read .ucm-s.
We can then rename it Encode::Perl ;-)

I am considering that kind of option but I am not sure if it should goto the perl dist. Thanks to your compile script, Encode is now smartenough to handle most of the major encodings without a help ofEncode::Tcl (ISO-2022 types are so far indivisually handled by perlmodules, such as Encode::JP::JIS).We can go even wilder. I am thinking of developing something likeUnicode::DataBase to implement full support for ISO-2022-(INT|JP-2).The current problem to implement ISO-2022 is encoding; You have to towhat character set a given (Unicode) character maps to but thanksto the character unification rule, this is impossible just by looking atthe character.The solution is to have a database and lookup each character to findwhat character sets have corresponding codepoints, then pick one up by agiven precedence (for instance, you go like "try JIS X 0208, then GB2312, then KSC 5601 for ISO-2022-JP-2). But we need a database to beginwith....


Dan the Man with Too Many Encodings to Support

<Prev in Thread]

Current Thread

[Next in Thread>

Previous by Date:

Re: [Encode] 8.3 rules sucks! check83.pl is obsolete!, Autrijus Tang

Next by Date:

The -san is normal addressing in Japanese like Mr., Dan Kogai

Previous by Thread:

Re: [Encode] Proposal; Make them all .ucm and detach Encode::Tcl, Nick Ing-Simmons

Next by Thread:

Re: [Encode] 8.3 rules sucks! check83.pl is obsolete!, Nick Ing-Simmons

Indexes:

[Date] [Thread] [Top] [All Lists]