If anyone can offer guidance on this it would be greatly appreciated. I
know encoding questions are very common, and are oftentimes answered in the
FAQ, but I can't seem to find anything on how to handle this type of
instance.
I've tested a few instances with an XSL conversion using UTF-8, for example
& # x0061;& # x02DB, and the charcters display separately -- lowercase a
followed by an ogonek. The behavior seems perfectly logical, but I was
expecting (hoping for) something different -- lowercase a with an ogonek
attached ( or & # x00105;).
Many thanks,
-m
I'm getting some XML documents that use U+ notation ready for browser
display so I'm converting the notation -- U+0107 to & # x0107;.
Things were moving along fine until I ran into U+0065+U+02DB sequence (and
many others like it) in a document on Poland.
Is the "proper" way to convert these instances to just run the converted
Unicode characters together and drop the middle +? Will these characers
display properly if an XSLT encodes the document as ISO-8859-1 or UTF-8?
Many thanks,
-m
_________________________________________________________________
See when your friends are online with MSN Messenger 6.0. Download it now
FREE! http://msnmessenger-download.com
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list