xsl-list
[Top] [All Lists]

Re: [xsl] Character encoding/representation from ISO-8859-1 to UTF-8

2016-10-11 14:55:43
Hi Eliot

On Tue, Oct 11, 2016 at 3:36 PM, Eliot Kimber ekimber(_at_)contrext(_dot_)com <
xsl-list-service(_at_)lists(_dot_)mulberrytech(_dot_)com> wrote:

The characters are not just the ASCII bytes.

I think you will need to match on the characters in question and replace
them with the desired character, e.g.:

<xsl:template match="text()[contains(., '’')]">
<xsl:value-of select="replace(., '’', '’')"/>
<xsl:template/>

And then use a more complete identity transform that handles the text
nodes:

Thank you for the response. I'm afraid I'm guilty of providing an
incomplete picture of my issue: I'm not sure what malformed(?) characters
are in the input documents. My apologies for leaving that detail out, but
seems like it would present a fairly significant problem for doing a
replace().


Cheers,

Eliot

Again, thank for your time and trouble.
Bridger


--
Eliot Kimber
http://contrext.com



From: "Bridger Dyson-Smith bdysonsmith(_at_)gmail(_dot_)com" 
<xsl-list-service@lists.
mulberrytech.com>
Reply-To: <xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com>
Date: Tuesday, October 11, 2016 at 2:59 PM
To: <xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com>
Subject: [xsl] Character encoding/representation from ISO-8859-1 to UTF-8

<?xml version="1.0" encoding="iso-8859-1"?>
<documents>
<document>The reality of the effect of natural ventilation in a
residential attic cavity has been the topic of many debates and scholarly
reports since the 1930’s.</document>
</documents>
XSL-List info and archive <http://www.mulberrytech.com/xsl/xsl-list>
EasyUnsubscribe <-list/1230532> (by
email <>)

--~----------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
EasyUnsubscribe: http://lists.mulberrytech.com/unsub/xsl-list/1167547
or by email: xsl-list-unsub(_at_)lists(_dot_)mulberrytech(_dot_)com
--~--
<Prev in Thread] Current Thread [Next in Thread>