[Top] [All Lists]

Re: [xsl] For-each-group group-starting-with drops text between inline elements

2020-09-03 10:27:43
Am 03.09.2020 um 17:20 schrieb Terry Ofner tdofner(_at_)gmail(_dot_)com:
I have a document with the following structure:
<spanclass="itemNum">(1)</span>First <b>sentence</b> of the passage.
<spanclass="itemNum">(2)</span> Second sentence of the passage.
<spanclass="itemNum">(3)</span> Third sentence of the passage.

     I need to chunk this into separte items:

<pclass="item"itemNum="(1)"><b>(1)</b> First <b>sentence</b> of the
<pclass="item"itemNum="(2)"><b>(2)</b> Second sentence of the passage.</p>
<pclass="item"itemNum="(3)"><b>(3)</b> Third sentence of the passage.</p>

     If there were no nodes in the text between spans, I could use
tokenize, which I do on such occasions.
     With sets such as the one above, I have been trying to use
for-each-group. But I am unable to capture the text between the span
     Here is the relevant section of my current stylesheet (3.0 Saxon-PE


To include text nodes or any nodes in the grouping population you need
to use /node() instead of /* in the path for the select.

The only issue might be the text node before the first span, perhaps
using /node()[normalize-space()] is better or inside of the
for-each-group you need to check whether you have a "real" group
starting with a "span" or just collected leading text.

XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
EasyUnsubscribe: http://lists.mulberrytech.com/unsub/xsl-list/1167547
or by email: xsl-list-unsub(_at_)lists(_dot_)mulberrytech(_dot_)com

<Prev in Thread] Current Thread [Next in Thread>