Have you considered running the files through HTML Tidy in a batch process? It
would take about ten minutes to script the batch job, and all your messy HTML
would be transformed into well-formed XML.
--
Charles Knell
cknell(_at_)onebox(_dot_)com - email
-----Original Message-----
From: "Jaques, Yves (FIDI)" <Yves(_dot_)Jaques(_at_)fao(_dot_)org>
Sent: Wed, 13 Aug 2003 15:36:53 +0200
To: "'XSL-List(_at_)lists(_dot_)mulberrytech(_dot_)com'"
<XSL-List(_at_)lists(_dot_)mulberrytech(_dot_)com>
Subject: [xsl] pass malformed HTML through the parser?
Our site is XML/XSL, however we are about to receive a thousand pages of old
static html that we will never have the time to turn into XHTML. I would
like to be able to spit it through the parser without parsing it so that I
can wrap our site template around the static html as I do for our other
content. Is this possible?
I have tried the following:
-- using the document() function but the parser just ignored the file as it
was not XML.
Yves Jaques
Information Officer FIDI
FIDI - room F-320
United Nations Food and Agriculture Organisation
viale delle Terme di Caracalla - 00100 Roma - ITALIA
_________________________________________
yves(_dot_)jaques(_at_)fao(_dot_)org
tel. : (39) 0657056058 fax.: (39) 0657052476
Please visit our web site at:
HTTP://WWW.FAO.ORG/FI
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list