On Sat, Dec 3, 2011 at 11:51 PM, Alex Teslik wrote:
OpenWebMail has HTML handling and HTML to text conversions specifically for
email. They are tested and could probably be integrated into mhonarc with
It would be interesting to see what kind of test data has
been used to verify how good it is at sanitizing data and
how well it handles specially crafted large emails.
A quick scan at some of the regexes indicate some things
may still get through. If you are interested, you can
examine the comments of mhonarc's mhtxthtml.pl filter to
get an idea of the crap one has to deal with.