I'm stuck trying to eliminate an opening tag from an XML
file. Keep in mind: I cannot change the format of the source
file. It has HTML embedded in a CDATA section. I am running a
series of regex replaces and concats to clean up the source.
Presumably your "Source" below is this "HTML embedded in a CDATA section"?
It doesn't look much like HTML to me!
But occasionally the source file has a "<?xml:namespace
prefix = lbxrt />" tag stuck in the HTML section that I need
to simply delete. I am trying to use a regex group replace to
do this, but regex always leaves "<?" no matter what I do.
It's as if it starts the match after the "?".
You haven't escaped the ? in your regex. "<?" in a regex matches "<" or
nothing, it doesn't match "<?". You need "<\?".
Note that writing the ? as an XML character reference doesn't escape it.
Your ? is translated to a ? long before the regex parser sees it.
Michael Kay
http://www.saxonica.com/
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--