x02DB is a spacing ogonek. You want a combining ogonek, which is x0340.
Michael Kay
-----Original Message-----
From: owner-xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com
[mailto:owner-xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com] On Behalf Of
M V
Sent: 03 November 2003 16:08
To: XSL-List(_at_)lists(_dot_)mulberrytech(_dot_)com
Subject: [xsl] Re: U+ conversion to Unicode characters
If anyone can offer guidance on this it would be greatly
appreciated. I
know encoding questions are very common, and are oftentimes
answered in the
FAQ, but I can't seem to find anything on how to handle this type of
instance.
I've tested a few instances with an XSL conversion using
UTF-8, for example
& # x0061;& # x02DB, and the charcters display separately --
lowercase a
followed by an ogonek. The behavior seems perfectly logical,
but I was
expecting (hoping for) something different -- lowercase a
with an ogonek
attached ( or & # x00105;).
Many thanks,
-m
I'm getting some XML documents that use U+ notation ready for browser
display so I'm converting the notation -- U+0107 to & # x0107;.
Things were moving along fine until I ran into U+0065+U+02DB
sequence
(and
many others like it) in a document on Poland.
Is the "proper" way to convert these instances to just run the
converted
Unicode characters together and drop the middle +? Will
these characers
display properly if an XSLT encodes the document as
ISO-8859-1 or UTF-8?
Many thanks,
-m
_________________________________________________________________
See when your friends are online with MSN Messenger 6.0.
Download it now
FREE! http://msnmessenger-download.com
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list