xsl-list
[Top] [All Lists]

[xsl] CDATA with different encoding

2009-08-26 04:06:10
Hi,

I've searched the archives with CDATA and Encoding but nothing resulted
so here's my post, maybe trivial, really sorry...

We're receiving xml from a supplier encoded in ISO-8859-1 as specified
also by the direvtive:

<?xml version="1.0" encoding="ISO-8859-1" ?>

but the tags body are encoded in UTF-8.

This would cause the parser to fail:

http://www.w3schools.com/xmL/xml_encoding.asp

So the supplier has surrounded the tag bodies with CDATA. 

<tag>[CDATA[ ...utf-8 ... ]]</tag>

Is this correct? I.e. is it possible to have a differnt encoding inside
a CDATA section from that of the xml?

I've googled a bit but havent found a clear response

We've built a parser with xmlbean last stable version, but the parser
complain about characters inside the tags that are UTF-8, but are
illegal in ISO-8859-1.

com.siap.DPKWebServices.Util.OTA_literal_HttpPost.queryHttp caught an
exception: 29047814 org.apache.xmlbeans.XmlException
 e.toString():org.apache.xmlbeans.XmlException: error: Illegal XML
character: 0x1c
org.apache.xmlbeans.impl.piccolo.io.IllegalCharException: Illegal XML
character: 0x1c
        at
org.apache.xmlbeans.impl.piccolo.xml.XMLReaderReader.read(XMLReaderReader.java:169)
        at
org.apache.xmlbeans.impl.piccolo.xml.PiccoloLexer.yy_refill(PiccoloLexer.java:3474)

Many thanks

Best regards...



-- 
Bartolomeo Nicolotti
SIAP s.r.l.
www.siapcn.it
v.S.Albano 13 12049
Trinità(CN) Italy
ph:+39 0172 652553
centralino: +39 0172 652511
fax: +39 0172 652519


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--

<Prev in Thread] Current Thread [Next in Thread>