xsl-list
[Top] [All Lists]

RE: Converting HTML to plain text

2004-06-22 08:11:03
At 09:09 PM 6/21/2004, Jelmer wrote:
>Generally, XSLT can't be used on arbitrary HTML. A poor man's solution is
>to use a tool like HTML Tidy to make XML for XSLT from the HTML, but I
>don't know if that could be adapted to your requirement for "a platform
>independent way" (IIRC it is compiled for different platforms).

http://jtidy.sourceforge.net/

Also there's a hint on John Cowan's "Tag Soup" page that suggests that Saxon's parser includes his utility to handle HTML tag soup. Worth a look.

Cheers,
Wendell


======================================================================
Wendell Piez                            
mailto:wapiez(_at_)mulberrytech(_dot_)com
Mulberry Technologies, Inc.                http://www.mulberrytech.com
17 West Jefferson Street                    Direct Phone: 301/315-9635
Suite 207                                          Phone: 301/315-9631
Rockville, MD  20850                                 Fax: 301/315-8285
----------------------------------------------------------------------
  Mulberry Technologies: A Consultancy Specializing in SGML and XML
======================================================================



<Prev in Thread] Current Thread [Next in Thread>