xsl-list
[Top] [All Lists]

RE: Converting HTML to plain text

2004-06-23 06:53:09
You can use lynx from a shell script like this

lynx -dump http://www.google.com >google.txt

This is useful if you want to convert a large number of pages

Though you might be able to do it by calling the ie activex object as well,
but it's less trivial than simply calling it from a shell script

-----Original Message-----
From: Daniel Joshua [mailto:daniel(_dot_)joshua(_at_)gridnode(_dot_)com] 
Sent: woensdag 23 juni 2004 12:45
To: xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com
Subject: RE: [xsl] Converting HTML to plain text

M$ IE 6.0 has a "Save As" feature which you can as text (.txt)... what does
Lynx do better?


Regards,
Daniel


-----Original Message-----
From: David(_dot_)Pawson(_at_)rnib(_dot_)org(_dot_)uk 
[mailto:David(_dot_)Pawson(_at_)rnib(_dot_)org(_dot_)uk]
Sent: Wednesday, 23 June, 2004 5:13 PM
To: xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com
Subject: RE: [xsl] Converting HTML to plain text


I've heard that the best way is to use the Lynx browser,
which has a save as text option.

HTH DaveP>

--
DISCLAIMER:

NOTICE: The information contained in this email and any attachments is
confidential and may be privileged. If you are not the intended
recipient you should not use, disclose, distribute or copy any of the
content of it or of any attachment; you are requested to notify the
sender immediately of your receipt of the email and then to delete it
and any attachments from your system.

RNIB endeavours to ensure that emails and any attachments generated by
its staff are free from viruses or other contaminants. However, it
cannot accept any responsibility for any  such which are transmitted.
We therefore recommend you scan all attachments.

Please note that the statements and views expressed in this email and
any attachments are those of the author and do not necessarily represent
those of RNIB.

RNIB Registered Charity Number: 226227

Website: http://www.rnib.org.uk

--+------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--+--



--+------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--+--




<Prev in Thread] Current Thread [Next in Thread>