On Wed, Mar 10, 2004 at 08:35:54AM -0600, the entity calling itself Skip
Montanaro stated:
A number of things come to mind:
1. Most people have little or no persistent public presence on the
net other than sending/receiving email. (The members of this
list probably represent a small minority of net users.) For
... <snipped for brevity>
on the few false negatives that slip through and shift the spamprob
of each of the hammy words in the paragraph in the direction of spam.
That paragraph will cease to be effective and the spammer will have
to find a new one.
I'm not familiar with the SpamBayes project you contribute to, or the
subtleties wrt how you group words, phrases, etc. for classification. No
doubt it's a valuable tool in the ongoing struggle to control spam.
But again, including these "word salad"/"zombie"/"hammy" inputs in the
training corpus for a Bayesian classifier will have an effect on how it
performs. It would be snake oil to claim otherwise.
Respectfully,
Jay
_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail