xsl-list
[Top] [All Lists]

RE: Encoding problem

2005-08-25 05:11:07
I have a xml with utf-8 encoding, I am parsing that
xml and then 
converting it to pdf using FOP.
While reading the xml file I am providing the encoding
as windows-1252, 

I suspect that when you say the XML file has "utf-8 encoding", you mean that
it starts with an XML declaration that specifies the encoding as being
utf-8; but that you suspect the encoding is really windows-1252.

Or it could be even worse: perhaps the file started life as windows-1252,
and was then put through an iso-8859-1 to utf-8 conversion. If so you're in
a real mess, and the best hope is to try and reverse the whole process.

First step is to analyze the problem: find one of the "unusual" characters
in the file and tell us (a) what the character is supposed to be, and (b)
the sequence of hex octets used to encode it.

Badly encoded XML files are not really in scope for this list, but since
it's so hard to locate where in the system an encoding problem has its root
cause, we're usually tolerant...

Michael Kay
http://www.saxonica.com/



--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--



<Prev in Thread] Current Thread [Next in Thread>