I've got XML documents, marked up to a DTD, and calling character entity
sets. When I run through the XSLT processor (xalan) to output another XML
file I find the entities have been converted to something different, and
fairly inconsistently.
What I would like to achieve is having “ ü in my input xml, and
these entities still being untouched in my output. Can anyone advise how I
achieve this please?
What I'm getting are (“, ü), or (“ and ü), or (“
and ü), depending on character encoding settings and entity sets used. Am I
missing something?
“ ü are not predefined character entities.
http://www.w3.org/TR/REC-xml#sec-predefined-ent
They apear as literal text strings
'&' 'l' 'd' 'q' 'u' 'o' ';'
and so when searialized to XML the '&' character is replaced by '&' giving
“
if you are making an HTML document and want these character entities you should
specify the correct character entity and put.
<xsl:output method="html" version="1.0" encoding="ISO-8859-1">
Edward Middleton
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list