-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On Mon, 23 Oct 2000, Earl Hood wrote:
Has anyone already written a tool to turn Mhonarc archives back into mbox
format?
Not that I am aware of.
OK, I guess I have a job to do. I'm kind of surprised that noone has ever
asked for this capability before, though. I guess everyone (except me) is
smart enough to keep the original mbox around just in case. I hope that
the people over at www.mail-archive.com are doing that in case VA Linux
ever decides to pull the plug. They are managing over 5,000 mailing lists
with Mhonarc!
The tool is non-trivial since dealing with attachments is the ugly
part, and there will always be some loss of information (which may
be acceptable in your case).
It should be alright. We've always had a pretty strict "text-only" policy
and plenty of net-cops to politely explain to offenders how to
re-configure their mail clients not to send HTML-formatted messages.
As a start, look at mhmsgfile.pl that is part of the MHonArc distribution
(used by mha-dbrecover).
Thanks, that was a good starting point for extracting header info from the
messages. Is there also a subroutine for extracting bodies? If so, I
hardly need to do any work! :-)
POSSIBLE BUG:
In glancing through the file mhtxtenrich.pl, I ran across the following
line of code (line 58 of mhtxtenrich.pl 2.3 99/06/25 14:18:01):
$data =~ s|<<|\<|gi;
Didn't you mean
$data =~ s|<|\<|gi;
?
I did a grep for '<<' and didn't find it anywhere else except in file
operations. I suspect a typo.
By the way, what revision control system are you using? I've only used
CVS before and I was wondering about the @(#) prefix in the file IDs.
Erik Rossen ^
rossen(_at_)freesurf(_dot_)ch /e\
http://www.multimania.com/rossen --- GPG key ID: 2935D0B9
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.0 (GNU/Linux)
Comment: Made with pgp4pine 1.75
iD8DBQE59VQUY88aPik10LkRAn5jAKDLSyJcSIzS98A/SSkHe5sEgDoRzACg0QU1
GYXoAenM6uyjpUDKjPNH1Vg=
=vB3k
-----END PGP SIGNATURE-----