Re: Why Spam is a problem

stanislav shalunov wrote:

Further, in its *present* form, where unfamiliar words are given 0.2
spam probability, easily defeated by just adding a lot of randomly
generated `words' like 9nscS9Ft, iuiF0kKw, 6AycPEbU, nsUdjGeP, etc.
Given enough of these, the Bayesian probability formula will declare
even a piece of mail that consists of a sales pitch for a pornographic
web site have a probability of being spam that is arbitrarily close to
0.2.

Is that true? The approach described looks only at the 15 words furthestfrom 0.5; it seems likely that most messages that would rank at 0.9 orabove would have enough spam-words that words at 0.2 wouldn't show up.

One thing that would be necessary, and that the author doesn't mention,would be to decode content-encodings before applying the filter;otherwise spammers could just base64 all their messages.


--
/=============================================================\
|John Stracke      |jstracke(_at_)centivinc(_dot_)com                    |
|Principal Engineer|http://www.centivinc.com                  |
|Centiv            |My opinions are my own.                   |
|=============================================================|
|*BOOM* "Thank you, Beaker. Now we know that is definitely too|
|much gunpowder." -- Dr. Bunsen Honeydew                      |
\=============================================================/

<Prev in Thread]	Current Thread	[Next in Thread>
Re: Why Spam is a problem, (continued) Re: Why Spam is a problem, Brian Bisaillon Re: Why Spam is a problem, Fred Baker Re[2]: Why Spam is a problem, Richard Welty Re: Why Spam is a problem, Keith Moore Re: Why Spam is a problem, Vernon Schryver Re: Why Spam is a problem, Bill Cunningham Re: Why Spam is a problem, Valdis . Kletnieks Re: Why Spam is a problem, Alex Audu Re: Why Spam is a problem, Frank Solensky Re: Why Spam is a problem, stanislav shalunov Re: Why Spam is a problem, John Stracke <= Re: Why Spam is a problem, stanislav shalunov Re: Why Spam is a problem, John Stracke Re: Why Spam is a problem, Perry E. Metzger Re: Why Spam is a problem, John Stracke Re: Why Spam is a problem, stanislav shalunov Re: Why Spam is a problem, Fred Baker Re: Why Spam is a problem, John Stracke Re: Why Spam is a problem, Keith Moore Re: Why Spam is a problem, Perry E. Metzger Re: Why Spam is a problem, Dave Crocker