perl-unicode

Re: filtering out non-Japanese

2004-12-15 08:30:05
At 12:39 pm +0100 15/12/04, Marco Baroni wrote:

where can I find the hexadecimal hiragana, katakana and kanj ranges?


Here's a very quick way (at least in Mac OS X):


for (@INC) { m~system.+5[^/]~i and $f = "$_"."/unicore/blocks.txt";}
open F, $f;
for (<F>) { m~hiragana|katakana|cjk~i and print }


JD

<Prev in Thread] Current Thread [Next in Thread>