mharc-users

Re: [approved] NEWBE: Immediate rebuilt of archive

2003-05-08 12:17:53
On May 8, 2003 at 14:20, Steffen Kaiser wrote:

The default mhonarc is lacking a nice user interface and breaking the
archive at a monthly period, so picked up mharc, too, ran web-archive and
finally had some nice archive lists.

However, I now have the problem of doing a nice replacement of pipermail,
in the aspects of archiving and cataloguing the mails (what mharc does
nicely), but I have not come to the point, where I can pull MHonArc and
mharc together:

When mailman delivers a mail to the archive, the posts say to call
"mhonarc -add xyz", but in this way I cannot use the mrc files supplied
with mharc. How can I add the arriving post to just the mailing list
archive, which name (and therefore directory) I already know?

You do not want use this approach with mharc.  Mharc is designed to
work independently of the list management software.

The model mharc uses is to have a special user account that you
subscribe to the lists you want archived.  Mharc, via cron jobs, then
processes the incoming mailing and filters it according to the
information your provide in <mharc-root>/lib/lists.def.

Now, it is possible to support alternative "input" methods into mharc.
The ORGMAIL <mharc-root>/lib/config.sh allows you to specifying any
"incoming" mailbox file.   Also, there are the mh-month-pack and
mbox-month-pack scripts that can be used.  If using these techniques,
you would not use the read-mail and filter-spool script components, but
just the web-archive script.

For your case, you could have a script that mailman invokes for
each message it receives to append a copy to a mailbox file of your
choosing (you can use procmail to insure safe delivery).  Then, set
the ORGMAIL config.sh variable to that spool file (also make sure to
set IS_MAIL_SPOOL to the proper value).  The cron scripts will do
the rest as long as you define lists.def properly.

Do I really need to keep all posts of a mailing list in the flat file UNIX
mailbox and have rebuilt the mharc "view" of this mailbox via cron job?? I
was actually assuming that the mail is in the archive, hence, I do not
need the unix mailbox anymore; and when I loose interest in the posts of
1995, I just remove the subdirs, re-create the search index, and I'm done
with it.

mharc maintains the raw mailbox files to facilitate HTML archive
recovering and rebuilds.  The crontab is set up to gzip compress mailbox
files that have not been touched in a long time.

You are correct that if you do not want the old data, you can remove
the unix mailbox files that you no longer want.

--ewh

---------------------------------------------------------------------
To sign-off this list, send email to majordomo(_at_)mhonarc(_dot_)org with the
message text UNSUBSCRIBE MHARC-USERS