It is surprisingly possible to reverse engineer Word's HTML/CSS
export/import and derive a target for an XSLT transform. If done carefully
enough, one can even get some styles in Word (which are wonderfully useful).
It's a finicky job though, and pushes the edges when you get to things like
headers and footers (though much is possible in theory). Yet dreadfully
satisfying to "tame" Word. Don't expect to get the XML back though without
help.
I also like Joerg's other ideas.
Cheers,
Wendell
At 08:00 PM 10/14/2002, Joerg wrote:
Two more possibilities:
- using MS Word XML format - I don't know anything about it, but it
exists. In the future there will be XDocs
(http://www.microsoft.com/office/xdocs/default.asp).
- using an XML framework like Cocoon (http://xml.apache.org/cocoon/),
serialize the output to RTF and open this with Word. So you don't need any
additional Java coding, only XSLT.
___&&__&_&___&_&__&&&__&_&__&__&&____&&_&___&__&_&&_____&__&__&&_____&_&&_
"Thus I make my own use of the telegraph, without consulting
the directors, like the sparrows, which I perceive use it
extensively for a perch." -- Thoreau
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list