Re: recipe blocking mail with attachements

At 11:13 2005-06-21 -0500, Damian Menscher wrote:

Procmail wasn't exactly designed to filter spam. You should, at minimum,investigate the scoring features. Or, if you're intelligent, check intospamassassin.

Using or not using SA has no thing to do with intelligence. Severalcontributors here on the procmail list get along fine without it. I've nothad any spam to my inbox in a week, and that's when I added a couple ofrefinements to my filters to deal with some false pozzies on a few lists.

The irony of the regexp being used is that the body keywords will result inhis *OWN* message to this list not arriving back in his inbox.


To the OP:

Scoring would be appropriate if you're going to use simplistic terms -require some number of them to appear before considering the messagejunk. I score on a lot of terms which appear in regular conversation butwhich are more frequently used in spam - they're just not scored ultra-high.

You should set up a sandbox and place your rules in there then pump a lotof your email at it (old saved email, from BEFORE you started using thefilters, or grab up mailboxes from your spam account), so you can see justwhat will be affected. Refer to the VERBOSE logfile to see what conditionsare matching the messages. If you use the MATCH construct:


* \/(term|anotherterm|yetanotherterm)

  ^^ this bit here

then the logfile will end up including a variable assignment to MATCHshowing exactly which of multiple regexp components on a single line wasthe actual matched term (versus merely stating that the whole conditionmatched somehow).

That would allow you to more easily identify what terms are entirely toobroad in your expression.

Perfectly legitimate siglines on some messages will contain toll freenumbers. re*move is a legitimate english word, some of the other terms aretoo short and will (as already indicated by other replies) result inmatches in uu/base64 encoded files, and a unit of measurement isn't a wisechoice of singular word terms either.

Your conditions also make it clear that you seem to believe that they'rematched with case-sensitivity. They're not - unless you add the 'D'flag. So, the bracketed character classes are unnecessary.

Your first rule has multiple condition lines, which *ALL* have to match inorder for the message to be caught by that ruleset. Break it out intoseparate rules (one for the MessageID, another for the Subject, another forthe From:, etc). or use scoring - prefix each condition line like so:


* 9876543210^0 condition

that curious numeric is simply an easy to remember "maximal" value -greater than 2^31 (signed 32 bit value), which says "when this conditionmatches, disregard the rest of the scored tests and consider this messagematched" or something to that effect. If you have non-scored conditions,they'll still have to be evaluated as TRUE for the rule to succeed. Read'man procmailsc'.

See the URL in my sigline for links to the sandbox I publish (which willalso automatically redirect the forwards you do in your recipes).


---
 Sean B. Straw / Professional Software Engineering

 Procmail disclaimer: <http://www.professional.org/procmail/disclaimer.html>
 Please DO NOT carbon me on list replies.  I'll get my copy from the list.


____________________________________________________________
procmail mailing list   Procmail homepage: http://www.procmail.org/
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail