perl-unicode

Re: Inverse of /\p{script}/

2003-08-28 10:30:06
On Thu, Aug 28, 2003 at 03:16:20PM +0100, nick(_at_)ing-simmons(_dot_)net wrote:

Does the existing perl5.8.* Unicode support have a way to efficently 
determine which script(s) or block (in unicode sense) a code point belongs
to?

        use Unicode::UCD qw(charscript charblock);
        print charscript(0x0388);
        print charblock (0x30a0);

It seems to make sense to have a hash which maps script names to 
probable (font) encodings 

 (Hiragana | Katakana | Han) => 'jisx0208.1990-0'
 (Greek)                     => 'iso8859-7',  

I dunno about script->font mappings...

So give a (1 character) string how do I get Unicode script/block it is in?

-- 
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> http://www.iki.fi/jhi/ "There is this 
special
biologist word we use for 'stable'.  It is 'dead'." -- Jack Cohen

<Prev in Thread] Current Thread [Next in Thread>