xsl-list
[Top] [All Lists]

[xsl] Tokenize question: tokenize on words, spaces and punctuation

2011-03-16 19:34:17
Hi there,

This is really a question for XPath regex gurus:

I need to tokenize a string of text such that words, punctuation and spaces are split. So from this:

Oh, what a great day!

I need to get:

('Oh', ',', ' ', 'what', ' ', 'a', ' ', 'great', ' ', 'day', '!')

I've been hacking away at this for a while, but regexps aren't my strong suit. Can anyone help?

Cheers,
Martin


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--