Just curious Jeni,
Using:
<xsl:output method="xml" indent="yes"/>
<xsl:variable name="regex" select="'(([^_]*)_PARA)'"/>
<xsl:variable name="input" select="'ABC_PARA__PARA'"/>
<xsl:template match="/">
<xsl:analyze-string select="$input" regex="{$regex}">
<xsl:matching-substring>
<RULE_MATCH>
Input <xsl:value-of select="$input" />
</RULE_MATCH>
<REGEX><xsl:copy-of select="$regex" /></REGEX>
<GROUP1><xsl:value-of select="regex-group(1)" /></GROUP1>
<GROUP2><xsl:value-of select="regex-group(2)" /></GROUP2>
<GROUP3><xsl:value-of select="regex-group(3)" /></GROUP3>
.
.
</xsl:matching-substring>
<xsl:non-matching-substring>
<mismatch><xsl:value-of select="."/></mismatch>
</xsl:non-matching-substring>
</xsl:analyze-string>
</xsl:template>
produces
<?xml version="1.0" encoding="UTF-8"?>
<RULE_MATCH>
Input ABC_PARA__PARA</RULE_MATCH>
<REGEX>(([^_]*)_PARA)</REGEX>
<GROUP1>ABC_PARA</GROUP1>
<GROUP2>ABC</GROUP2>
<GROUP3/>
.
.
<mismatch>_</mismatch>
<RULE_MATCH>
Input ABC_PARA__PARA</RULE_MATCH>
<REGEX>(([^_]*)_PARA)</REGEX>
<GROUP1>_PARA</GROUP1>
<GROUP2/>
<GROUP3/>
.
.
Why are there two lots of output please?
regards DaveP
-----Original Message-----
From: Jeni Tennison
If you're using Saxon 7.6, why not use the built-in regular expression
support rather than Java extensions?
Example: if my regex is defined as:-
(([^_]*)_PARA)|((.*?)(PARA)(.*?))
and my input is "ABC_PARA"
Then I need to know what portions of the input matched
each (if any) part of the regex groups, ([^_]*) and (.*?) etc,.
in terms of group number and matching string.
If you use <xsl:analyze-string> then within the
<xsl:matching-substring>, the regex-group() function gives you access
to the values of the regex groups. Try:
<xsl:for-each select="$rule//Type">
<xsl:variable name="tdlType" select="." />
<xsl:analyze-string select="." regex="{$regex}">
<xsl:matching-substring>
<RULE_MATCH>
<xsl:value-of select="$rule//@name" />
</RULE_MATCH>
<REGEX><xsl:copy-of select="$regex" /></REGEX>
<GROUP1><xsl:value-of select="regex-group(1)" /></GROUP1>
<GROUP2><xsl:value-of select="regex-group(2)" /></GROUP2>
<GROUP3><xsl:value-of select="regex-group(3)" /></GROUP3>
...
</xsl:matching-substring>
</xsl:analyze-string>
</xsl:for-each>
Of course you need to know what regular expression is being used to be
able to tell which regex groups you're actually interested in...
By the way, I'm not sure why you're using "//" in $rule//@name -- this
will return all the name attributes of all the elements that are
descendants of $rule, as well as of $rule itself, and return them in a
whitespace-separated list (this being XSLT 2.0). Is that what you
really want?
Cheers,
Jeni
---
Jeni Tennison
http://www.jenitennison.com/
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
-
NOTICE: The information contained in this email and any attachments is
confidential and may be legally privileged. If you are not the
intended recipient you are hereby notified that you must not use,
disclose, distribute, copy, print or rely on this email's content. If
you are not the intended recipient, please notify the sender
immediately and then delete the email and any attachments from your
system.
RNIB has made strenuous efforts to ensure that emails and any
attachments generated by its staff are free from viruses. However, it
cannot accept any responsibility for any viruses which are
transmitted. We therefore recommend you scan all attachments.
Please note that the statements and views expressed in this email
and any attachments are those of the author and do not necessarily
represent those of RNIB.
RNIB Registered Charity Number: 226227
Website: http://www.rnib.org.uk
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list