If you want to write queries that handle all the nuances of natural
language text, I would strongly recommend using a text retrieval
language rather than XPath. Many XQuery implementations have free text
retrieval modules.
Michael Kay
Saxonica
On 18/01/2013 22:12, Costello, Roger L. wrote:
Hi Folks,
I want to determine if 'Lecœur' occurs in $text.
A naïve solution is this XPath expression:
contains($text, 'Lecœur')
However, that does not take into account many important factors:
1. Perhaps 'Lecœur' occurs, but in $text it is in uppercase
2. Perhaps instead of the 'œ' ligature, $text uses 'oe'
3. Perhaps in $text 'Lecœur' is split over two lines and thus is hyphenated
4. Perhaps 'Lecœur' is slightly misspelled in $text and therefore requires
fuzzy matching
And there are many other important factors.
Do you have an XPath solution to this problem that takes into account the many
important factors?
/Roger
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--