I've been working on a recipe to filter out those pesky s*bscribe and
uns*bscribe messages from mailing lists, and I'm posting what I have
so far. As an aside, it also filters out very short messages, which
I've found are usually some sort message meant for list owner/request
address.
I give heavy weight to Subjects starting with (un)?s*bscribe, with
also pretty heavy weight to Subjects containing either of those words.
I then give heavy weight to the body of messages starting with those
words, and a lighter weight to lines starting with them. Then
multiple occurrences get some weight too, up to a point. Then I count
the words in the message against all that.
:0B
* 1^0
* 30^0 H ?? ^Subject: +(un)?subscribe\>
* 20^0 H ?? ^Subject:.*\<(un)?subscribe\>
* 20^0 ^^([ ]|$)*(un)?subscribe\>
* 10^0 ^([ ])*(un)?subscribe\>
* 8^.4 \\<(un)?subscribe\>
* -.4^1 \\<[A-Za-z]+\>
junk/.
Here is the score for the current message:
procmail: Score: 1 1 ""
procmail: Score: 0 1 "^Subject: +(un)?subscribe\>"
procmail: Score: 0 1 "^Subject:.*\<(un)?subscribe\>"
procmail: Score: 0 1 "^^([ ]|$)*(un)?subscribe\>"
procmail: Score: 0 1 "^([ ])*(un)?subscribe\>"
procmail: Score: 12 13 "\\<(un)?subscribe\>"
procmail: Score: -47 -33 "\\<[A-Za-z]+\>"
--
Rodger Anderson