mharc-users

Re: Weird header format - and how to get .catch messages newly scanned and archived?

2005-05-06 10:39:07
On May 6, 2005 at 10:26, "Uwe E. Bilger" wrote:

The problem is that yahoo obfuscates parts of the headers. At least
that's what I think the problem is. The real problem is that a lot of
the messages don't get archived properly: They get sorted in the
.catch folder, and there, they don't even have subjects. One example
of such a message header:

Since the domain portion of addresses are not present, the procmail
recipies used by mharc can not properly sort the messages.

One possile solution is to not use the .newmail import technique.
If you have grouped your mbox files so that each file contains
messages from a single list, you can use the mbox-month-pack script,
<http://www.mhonarc.org/mharc/doc/bin/mbox-month-pack.html>, provided
in mharc to break up each mbox file into monthly files for directing
importing into the raw mbox area.

Once messages are are in the raw mbox area, you can run the web-archive
script directly to update the HTML archives.

Note, it does appear you do have your lists.def settings to deal
with the missing domains by your last Address option setting:

  Name: kirchenmusik
  Description: Kirchenmusikarchiv
  Address: kirchenmusik(_at_)yahoogroups(_dot_)de
  Address: kirchenmusik(_at_)(_dot_)*

However, if the "kirchenmusik(_at_)(_dot_)*" was added after you processed your
mail one time via .newmail, then reprocessing the same mail will
not work.

Why? Mharc keeps a cache of message-IDs so if a message with the
same message-ID is encountered, it is silently dropped.  This is done
to avoid archiving duplicate messages.

If you want to force mharc to refilter the messages, you will need
to delete the msgid.cache file at the root of your mharc install.
Then, if you do the .newmail import trick again, the messages will
be processed.

--ewh

---------------------------------------------------------------------
To sign-off this list, send email to majordomo(_at_)mhonarc(_dot_)org with the
message text UNSUBSCRIBE MHARC-USERS

<Prev in Thread] Current Thread [Next in Thread>