xsl-list
[Top] [All Lists]

Re: [xsl] Character encoding/representation from ISO-8859-1 to UTF-8

2016-10-14 06:22:23

On 11 Oct 2016, at 21:00, Bridger Dyson-Smith 
bdysonsmith(_at_)gmail(_dot_)com 
<xsl-list-service(_at_)lists(_dot_)mulberrytech(_dot_)com> wrote:

<?xml version="1.0" encoding="iso-8859-1"?>
<documents>
      <document>The reality of the effect of natural ventilation in a 
residential attic cavity has been the topic of many debates and scholarly 
reports since the 1930’s.</document>
</documents>

It looks very much like
1) in the XML header you claim the document is ISO-8859-1 encoded, while really
2) it is not. I can see that one character, that ‘ , was decoded as three 
(’). Had the document really been encoded with ISO-8859-1, any decoding would 
have ended up with at most one character (because ISO-8859-1 does not use 
multibyte characters).

try to replace “iso-8859-1” in the xml header with “utf-8”, does that work?

Regards, Soren
--~----------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
EasyUnsubscribe: http://lists.mulberrytech.com/unsub/xsl-list/1167547
or by email: xsl-list-unsub(_at_)lists(_dot_)mulberrytech(_dot_)com
--~--
<Prev in Thread] Current Thread [Next in Thread>