procmail
[Top] [All Lists]

Re: Those pesky s*bscr*b*/uns*bscr*b* messages

1995-10-20 13:23:53
I've been working on a recipe to filter out those pesky s*bscribe and
uns*bscribe messages from mailing lists, and I'm posting what I have
so far.  As an aside, it also filters out very short messages, which
I've found are usually some sort message meant for list owner/request
address.

I give heavy weight to Subjects starting with (un)?s*bscribe, with
also pretty heavy weight to Subjects containing either of those words.
I then give heavy weight to the body of messages starting with those
words, and a lighter weight to lines starting with them.  Then
multiple occurrences get some weight too, up to a point.  Then I count
the words in the message against all that.


:0B
*  1^0
* 30^0 H ?? ^Subject: +(un)?subscribe\>
* 20^0 H ?? ^Subject:.*\<(un)?subscribe\>
* 20^0 ^^([      ]|$)*(un)?subscribe\>
* 10^0 ^([       ])*(un)?subscribe\>
* 8^.4 \\<(un)?subscribe\>
* -.4^1 \\<[A-Za-z]+\>
junk/.


Here is the score for the current message:

procmail: Score:       1       1 ""
procmail: Score:       0       1 "^Subject: +(un)?subscribe\>"
procmail: Score:       0       1 "^Subject:.*\<(un)?subscribe\>"
procmail: Score:       0       1 "^^([   ]|$)*(un)?subscribe\>"
procmail: Score:       0       1 "^([    ])*(un)?subscribe\>"
procmail: Score:      12      13 "\\<(un)?subscribe\>"
procmail: Score:     -47     -33 "\\<[A-Za-z]+\>"

--
Rodger Anderson