Re: Another Newbie Question.

Thank you for the reply.

I have no problem against filtering a message from the body, but what my
question was, is filtering messages against a word list that I can keep
in a file, instead of adding more and more rules in my /etc/procmailrc

That was the question in short I don't know what even my explanation on
what I was looking for didn't clear this.






On Thu, 13 Mar 2003 08:58:45 -0800, PSE-L(_at_)mail(_dot_)professional(_dot_)org
(Professional Software Engineering) wrote:

At 22:05 2003-03-12 -0800, Multimedia Fan wrote:

My etc/procmailrc is growing and is becoming hard to track.


Break it into component parts and use INCLUDERC to include them into the 
central file.  This is detailed in 'man procmailrc'

Is there a way that I organize the spam detection in the BODY of the
message to fgrep certain words that I was told to filter out?


'man procmail'

then, hit:

        /the body

and press enter (or, if you're adventurous, just search for 'body').

Let's assume that I have SPAMMERS file containing words that we need to
filter.


"SPAMMERS" as a filename rather implies that it is spammer identifications, 
not, say "SPAMTERMS"

How do I check against the message body and not the headers?
Looking at the above, is it possible to check against the subject line
too,  (-x Subject:)?


If you refer to the procmail list archvies and do a few searches, or check 
the sandbox referred to in my .sigline disclaimer, you'll find that some 
people extract certain headers right up front, then they have them in 
variables which they can check easily, like so:

* SUBJECT ?? some|list|of|words

You already know the formail invocation of checking against specific 
headers if that's the route you want to take.

BTW, pursuant to an exchange which you had with someone else here, if 
spending the time to learn procmail, or manage the scripts becomes too much 
of a bother, you might consider just using one of the managed spam 
filtering packages such as SpamAssassin.  That isn't my approach, but if 
you're intent on dealing with spam effectively, you'll find yourself 
spending a lot of _time_ keeping on top of your rules.

Further, simple keyword rulesets are going to give you a LOT of grief - if 
you had a body keyword filter running, your OWN post asking for assistance 
with writing the filter would have been rejected, as would any of the 
replies containing those keywords.  Several may appear in legitimate 
discussions as well.

If you really insist on managing it all yourself, you might take the _time_ 
to investigate 'man procmailsc' - scoring.  Where you judge a message as 
being 'spammish' based on a number of criteria, individually which may be 
insufficient to mark a message as spam, but measured together, identify the 
message as crap.

Another thing to consider: DO NOT SIMPLY /dev/null SUSPECTED MESSAGES.  You 
sure as hell shouldn't do that from a recipe running in /etc/procmailrc 
that you're _testing_.  Unless you're the sole user of your system, and you 
don't mind losing legitimate mail while you're not learning procmail.

---
 Sean B. Straw / Professional Software Engineering

 Procmail disclaimer: <http://www.professional.org/procmail/disclaimer.html>
 Please DO NOT carbon me on list replies.  I'll get my copy from the list.


_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail




_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail