On Thu, Aug 28, 2003 at 03:16:20PM +0100, nick(_at_)ing-simmons(_dot_)net wrote:
Does the existing perl5.8.* Unicode support have a way to efficently
determine which script(s) or block (in unicode sense) a code point belongs
to?
use Unicode::UCD qw(charscript charblock);
print charscript(0x0388);
print charblock (0x30a0);
It seems to make sense to have a hash which maps script names to
probable (font) encodings
(Hiragana | Katakana | Han) => 'jisx0208.1990-0'
(Greek) => 'iso8859-7',
I dunno about script->font mappings...
So give a (1 character) string how do I get Unicode script/block it is in?
--
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> http://www.iki.fi/jhi/ "There is this
special
biologist word we use for 'stable'. It is 'dead'." -- Jack Cohen