On 06/12/2011 22:42, Karlmarx R wrote:
Hello David,
Yes, I do process the content in 2 stages, preprocess into one form
of XML and then further process that to my final XML form. BUT, BOTH
are done in XSL with one signle file and the problem that I reported
is in first stage conversion itself. To make things even more clear,
here is a rough skeleton and explanation of my process.I get the
entire content of the input into a variable $input-text, and then
tokenize it to get each line of data into another variable, as
below.
<xsl:variable name="lines" select="tokenize($input-text, '\r?\n')"/>
ah, so perhaps something like
<xsl:stylesheet version="2.0"
xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
xmlns:d="data:,dpc"
exclude-result-prefixes="d"
>
<xsl:import
href="http://web-xslt.googlecode.com/svn/trunk/htmlparse/htmlparse.xsl"/>
<xsl:variable name="in"><![CDATA[
Line one text <b>within valid node</b> and like <II .> Title etc
Line two with <1a .> Title etc, <i>within</i> <b>something</b> etc
another line can be just normal text
]]></xsl:variable>
<xsl:template name="main">
<out>
<xsl:sequence select="d:htmlparse($in,'',false())"/>
</out>
</xsl:template>
</xsl:stylesheet>
which makes
saxon9 -it main a.xsl
<?xml version="1.0" encoding="UTF-8"?><out>
Line one text <b>within valid node</b> and like <II .> Title etc
Line two with <1a .> Title etc, <i>within</i> <b>something</b> etc
another line can be just normal text
</out>sh-4.1$ saxon9 -it main a.xsl
<?xml version="1.0" encoding="UTF-8"?><out>
Line one text <b>within valid node</b> and like <II .> Title etc
Line two with <1a .> Title etc, <i>within</i> <b>something</b> etc
another line can be just normal text
</out>
David
________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.
This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs.
________________________________________________________________________
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--