procmail
[Top] [All Lists]

Re: Anti-SPAM recommendations

1998-07-09 04:30:29
On Wed, 08 Jul 1998 17:26:05 +0200, Harald Hanche-Olsen
<hanche(_at_)math(_dot_)ntnu(_dot_)no> wrote:
- Jacques Gauthier <jacques_g(_at_)yahoo(_dot_)com>:
| if sender is the same as the receiver then it's spam
I don't think I have ever seen spam with this characteristic.

FWIW, here's some statistics for those who are wondering. These are
grepped out of the last week or so of the spam-list mailing list. More
information at <http://www.spam-archive.org/>

 $ grep -c '^From ' ~/scratch/inbox/spam-filtered*
 /h/3/ykiel/reriksso/scratch/inbox/spam-filtered:19
 /h/3/ykiel/reriksso/scratch/inbox/spam-filtered.980704:25
 /h/3/ykiel/reriksso/scratch/inbox/spam-filtered.980705:181
 /h/3/ykiel/reriksso/scratch/inbox/spam-filtered.980706:89
 /h/3/ykiel/reriksso/scratch/inbox/spam-filtered.980707:169
 /h/3/ykiel/reriksso/scratch/inbox/spam-filtered.980708:93

 $ grep 'To: equals From: ' ~/scratch/inbox/spam-filtered.980705 | head -3
 X-Rejected: To: equals From: jou1(_at_)prodigy(_dot_)com
 X-Rejected: To: equals From: ghollowell(_at_)hotmail(_dot_)com
 X-Rejected: To: equals From: 
jeu33(_at_)wani(_dot_)dohilab(_dot_)dnj(_dot_)ynu(_dot_)ac(_dot_)jp

 $ sed -e '/^X-Rejected: /!d' -e 's/ [^ @]*(_at_)(_dot_)*//' -e 's/\[.*/\*/' \
 ~/scratch/inbox/spam-filtered* | sort | uniq -c | sort -rn | head
     351        X-Rejected: body contains ugly words *
     265        X-Rejected: To: equals From:
     223        X-Rejected: From .com *
     205        X-Rejected: Received: after From: 
     197        X-Rejected: Received: from x.A (y.B) by z.A
     134        X-Rejected: Short and sweet, many phone numbers *
     127        X-Rejected: Received: from a dialup
     118        X-Rejected: Over 6000 bytes *
     108        X-Rejected: Spam score 3 *
      68        X-Rejected: No valid Message-Id

 $ grep -h '^X-Rejected: ' ~/scratch/inbox/spam-filtered* | wc -l
    2344

Those marked with a "*" are based on additional heuristics, such as a
certain amount of "ugly words" in the header, or being from a
"suspect" domain, etc.

The spam-list spam is obviously only representative for the spam that
people who feed the list are receiving. (If you don't presently feed
the list, please consider signing up. There's no need to receive
anything, but you help people for whom receiving others' spam is
important.) 

/* era */

-- 
 Paparazzi of the Net: No matter what you do to protect your privacy,
  they'll hunt you down and spam you. <http://www.iki.fi/~era/spam/>

<Prev in Thread] Current Thread [Next in Thread>