Re: Why Spam is a problem

Frank Solensky <fsolensky(_at_)premonitia(_dot_)com> writes:

Just posted on slashdot: a Bayesian approach to the problem that reports
to have rates of 0.5% on false positives and 0% false negative:
http://www.paulgraham.com/spam.html


Nice short-term approach.

Unfortunately, easily defeated with just appending (perhaps as an HTML
comment) a long innocent-looking fragment (e.g., a 30KB piece from a
random book).

Further, in its *present* form, where unfamiliar words are given 0.2
spam probability, easily defeated by just adding a lot of randomly
generated `words' like 9nscS9Ft, iuiF0kKw, 6AycPEbU, nsUdjGeP, etc.
Given enough of these, the Bayesian probability formula will declare
even a piece of mail that consists of a sales pitch for a pornographic
web site have a probability of being spam that is arbitrarily close to
0.2.

-- 
Stanislav Shalunov              http://www.internet2.edu/~shalunov/

"Which one is worse?  Both are worse."          -- V. I. Lenin

<Prev in Thread]	Current Thread	[Next in Thread>
Why Spam is a Problem, (continued) Why Spam is a Problem, Bill Cunningham Re: Why Spam is a problem, Brian Bisaillon Re: Why Spam is a problem, Fred Baker Re[2]: Why Spam is a problem, Richard Welty Re: Why Spam is a problem, Keith Moore Re: Why Spam is a problem, Vernon Schryver Re: Why Spam is a problem, Bill Cunningham Re: Why Spam is a problem, Valdis . Kletnieks Re: Why Spam is a problem, Alex Audu Re: Why Spam is a problem, Frank Solensky Re: Why Spam is a problem, stanislav shalunov <= Re: Why Spam is a problem, John Stracke Re: Why Spam is a problem, stanislav shalunov Re: Why Spam is a problem, John Stracke Re: Why Spam is a problem, Perry E. Metzger Re: Why Spam is a problem, John Stracke Re: Why Spam is a problem, stanislav shalunov Re: Why Spam is a problem, Fred Baker Re: Why Spam is a problem, John Stracke Re: Why Spam is a problem, Keith Moore Re: Why Spam is a problem, Perry E. Metzger

Previous by Date:	Re: Why Spam is a problem, Frank Solensky
Next by Date:	Re: Why Spam is a problem, Brian Bisaillon
Previous by Thread:	Re: Why Spam is a problem, Frank Solensky
Next by Thread:	Re: Why Spam is a problem, John Stracke
Indexes:	[Date] [Thread] [Top] [All Lists]