At 5:42 PM -0700 2003/08/12, Justin Mason wrote:
OVERALL% SPAM% HAM% S/O RANK SCORE NAME
495260 343948 151312 0.694 0.00 0.00 (all messages)
Hmm. This is the total number of messages, and not the total
number of IP addresses to look up, right? Do you have any idea how
many IP addresses there are per message that you have to look up?
My own spam archive is pretty short. It only goes back to Sat
Mar 02 00:16:31 2002, comprises 130,338,812 bytes, contains 49,848
messages, and has 20,411 unique IP addresses that I've been able to
find.
My "ham" archive is quite a bit larger. Some folders go back as
far as 1995, it comprises 1,041,157,733 bytes, 506,115 messages, and
supposedly 64,164 unique IP addresses (I suspect that my command line
that I used to find the unique IP addresses in the spam archive was
not able to handle the amount of input from the ham archive).
I am curious -- is there a reason why you tested with a much
larger spam archive than your ham archive?
--
Brad Knowles, <brad(_dot_)knowles(_at_)skynet(_dot_)be>
"They that can give up essential liberty to obtain a little temporary
safety deserve neither liberty nor safety."
-Benjamin Franklin, Historical Review of Pennsylvania.
GCS/IT d+(-) s:+(++)>: a C++(+++)$ UMBSHI++++$ P+>++ L+ !E-(---) W+++(--) N+
!w--- O- M++ V PS++(+++) PE- Y+(++) PGP>+++ t+(+++) 5++(+++) X++(+++) R+(+++)
tv+(+++) b+(++++) DI+(++++) D+(++) G+(++++) e++>++++ h--- r---(+++)* z(+++)
_______________________________________________
Asrg mailing list
Asrg(_at_)ietf(_dot_)org
https://www1.ietf.org/mailman/listinfo/asrg