procmail
[Top] [All Lists]

Re: Need pointers for HTML --> ASCII conversions

1998-06-20 21:33:58
On Sat, 20 Jun 1998, Stan Ryckman wrote:

At 05:37 PM 6/19/98 +0300, Jari Aalto+list.procmail wrote:
|Fri 1998-06-19 Timothy J Luoma 
<procmail(_at_)Unix(_dot_)luomat+Lists(_dot_)nokia(_dot_)com> list.procmail
| I'm sure someone has written a program script with lynx to convert HTML to 

| regular ASCII text, but can't find any references to it
Here is perl one liner:
   perl -0777 -pe 's/<[^>]*>//g'
It also attacks non-HTML very well, too.
In fact, it will even wreck any copies of itself that it finds  :)


I guess it could be called a quick and dirty solution, but it also misses
the nasty &nbsp things that some htmlisers sprinkle so liberally around the
text.


-- _______________________________________________________________________
  /         <*>   I'll design a proper signature real soon               /
 /__/ . / /_ _/  BillT  mailto:twa(_at_)np(_dot_)ac(_dot_)sg  
http://www.np.ac.sg/~twa   /
/__/_/_/_/ _/__________________________________________________________/

<Prev in Thread] Current Thread [Next in Thread>