On Fri, Jun 20, 2003 at 01:51:47AM -0500, Earl Hood wrote:
On June 19, 2003 at 23:16, Todd Slater wrote:
I just started playing with namazu and love it!
I indexed the html files of a Mailman (pipermail) archive. When I do a
search, the summary contains only the mail/archive stuff, like from,
next message, previous message etc. and nothing of the message body. Is
there a way to skip x lines for the summary, or is there a better way to
index the archive?
You will need to create a custom filter that understand pipermail
HTML pages. Namazu already has one for MHonArc archives, but
I do think there are any filters for other mail archivers.
It appears that pipermail does not clearly delimit message header
information, but at a minimum, you should be able to get namazu
to only index the message body.
--ewh
Thanks for the response. I'm not sure about writing a new filter as I
don't know perl; perhaps it would be easier if I tried to use MHonArc
instead.
For Mailman I'm using the rpm for Mandrake 9.1. If anybody has any tips
for getting MHonArc working with that I'd appreciate it.
Cheers,
Todd