Also Re: Saxon 9.4 <bold></bold> Transformed to (newline)</bold> Problem
My suggestion would be to flatten New Lines (etc.) to the Unicode "picture"
(␤, I think). Other whitespace can be handled the same way.
This may help[1].
Hyperlinks and "plain text" are not always what they seem once the Search
Engines search and Lawyers lawyer. So, the "link engineering" you might want
to handle another way[2].
And in the real (American English, anyway) world you have *nix heads who never
met a (possibly irreversible un-documentable) short-cut they didn't like.
Before you allow (lazy) people to flatten all diacritics out of Plain Text, try
this XSLT[3]. You should download a copy and look at the source because some
of the C1 Controls do not
display in browsers, when in fact they have been hijacked by Office Suites for
use as proprietary meta characters and need to be promoted/demoted to lose the
proprietary meaning.
--Gannon
[1] http://www.rustprivacy.org/2013/egov/penserv/
[2] http://www.rustprivacy.org/2013/egov/gateway/
[3] http://www.rustprivacy.org/2013/weather/escapeHTMLent.xsl
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--