xsl-list
[Top] [All Lists]

Re: [xsl] XSLT script to report Unicode characters and code blocks in file?

2008-05-29 14:14:06

 Has anyone invented this wheel already?

not the whole wheel but you may be able to borrow a few spokes from the
xml file and stylesheets in
http://www.w3.org/203/entities/2007xml

http://www.w3.org/2003/entities/2007xml/

unicode.xml has some xml with the unicode block ranges, which
characters.xsl uses to make table headings in tables such as

http://www.w3.org/2003/entities/2007doc/020.html

rather than use lots of xsl:choose coding you can just take firs 5  hex
value of the character code, ten just pick up the last block with a
start value (lexicographically) less than that in a sequence pulled from
this fragent of unicode.xml

   <unicodeblocks>
      <block start="00000" end="0007F" name="C0 Controls and Basic Latin"/>
      <block start="00080" end="000FF" name="C1 Controls and Latin-1 
Supplement"/>
      <block start="00100" end="0017F" name="Latin Extended-A"/>
      <block start="00180" end="0024F" name="Latin Extended-B"/>
      <block start="00250" end="002AF" name="IPA Extensions"/>
...


characters.xsl has variious utility functions for getting to and from
hex strings 

David


________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs. 
________________________________________________________________________

--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--