I wrote a transformation that uses unparsed-text() and regex processing to
create an XML version of the Unicode database;
interesting this is also what
http://www.w3.org/2003/entities/2007xml/unicodedata.xsl
does (although uniocde.xml also has a lot of extra data from other
sources, eg entity name info)
you can easily look up what code block a particular character falls
into because it's part of the data for each character.
The character class is, (represented as
<character id="U00022" dec="34" mode="text" type="other">
<unicodedata category="Po"
by the above, but that's not quite the same as the code block though is
it?
David
________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.
This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs.
________________________________________________________________________
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--