How about:
<xsl:template match="content">
<xsl:analyze-string select="." regex="<.*?>">
<xsl:non-matching-substring>
<xsl:value-of select="."/>
</xsl:non-matching-substring>
</xsl:analyze-string>
</xsl:template>
which when applied to:
<content><![CDATA[
<p>The <strong>keyword</strong> is nice to have but is not needed to
include in a solr feed</p>]]></content>
give this:
The keyword is nice to have but is not needed to
include in a solr feed
cheers
andrew
On 1 February 2010 14:06, bw <bwakkie(_at_)gmail(_dot_)com> wrote:
Hi Michael,
This is exactly why I want to remove it ;-). I was even thinking about
some fancy perl script command to remove it now.
On 29/01/2010, Michael Ludwig <milu71(_at_)gmx(_dot_)de> wrote:
bw schrieb am 29.01.2010 um 12:02:10 (+0100):
Hello,
I have a big xml feed out of my content management system that
includes wysiwyg html tags inside CDATA tags.
I am looking for a way to remove the CDATA and only get the text.
<content><?\s?[Ww](ak{2}ie)?$
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail:
<mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--
--
Andrew Welch
http://andrewjwelch.com
Kernow: http://kernowforsaxon.sf.net/
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--