forged localhost sender test

I periodically review the trickle of junk that gets through my spamfilters. I've put it off for a while since I recently moved and have a LOTof stuff to deal with - ignoring the spam doesn't take too much time sinceI already filter the heck out of my mail and spam generally ends up in theotherwise empty unfiltered box, or in my "random contacts to my listaddress" which is my singularly most spammed address. In returning to workthough, the cruft is annoying.

In recent months the technique of submitting messages using a forged SMTPEHLO greeting seems to have taken a sharp rise. This technique has beenused for about three years now, but nowhere near as often as recently, andI mean for SPAM - viruses have used it for a while as well, though I filterfor viruses at a global procmailrc and so they're not part of thestatistics I see for "junk".

Via this method, the submitter APPEARS to be submitting from your host(even though the actual host and IP as looked up during the SMTP exchangedo not match), whether the From: or even envelope addresses claim to befrom your host/domain. Often, they're using the IP as the hostname in theEHLO instead of the hostname itself. In any event, we can't simply discardmail because the EHLO doesn't match the IP - DNS being what it is, there'sa number of reasons this could occur and still not be a positive indicationof forgery (though it really should never occur from your OWN host ofcourse!). Sendmail already flags such mismatches with "may be forged", andI have a small spam score elevation for that event.

Some months ago, I added a "RELAYHOST" determination to my standardprocmail config, which I used for subsequent DNSBL checks. I've sinceupdated the sandbox template as published on my site, which is where youcan see the RELAYHOST extraction process (an invocation of formail to nabthe topmost received: header, and a few match operations, nothing magic atall). If you've got some hinkey MTA, you may need to change the syntaxused in the match constructs - they're based on the very common sendmailReceived: format.

The forged localhost submission recipe is quite basic -- you check theRELAYHOSTEHLO variable against known IPs and hostnames for yourlocalhost. For people running their own little host, this is VERYstraightforward, but if even if you're on some commercial hosting service,chances are the IPs you're expected to receive email on (AT THAT HOST) arelimited, as are the hostnames the system is known by (try 'host -t MXyour_domain' - and the lowest MX score should be the one where your mail isbeing processed at, 'host -t A your_mx_host' to get the one or more IPaddresses associated with that host, and 'host -t PTR your_mx_host_ip' foreach IP to get the hostname associated with the RDNS for the IPs). If themessage appears to have this host in the greeting, then the recipeprogresses to the second condition - checking that the known host IPs matchthe IP used in the connection for the mail host. 'localhost' should beincluded in the RELAYHOSTEHLO check, and 127.0.0.1 (the localhost loopbackIP) should be included in the RELAYHOSTIP check, since local submissionsshould be checked as well.

Users on some dyndns setup (running SMTP on dyndns? ick!) will likely havea config nightmare for this since your IP isn't consistent. This config isinappropriate for fetchmail setups where you're retrieving your messagesfrom a remote POP/IMAP host and re-injecting them locally (in which case,the topmost received should always be your host and will be legit, orperhaps it'd be your ISP mailhost, which still should be legit). If youtinkered with the RELAYHOST variable extractions to get the correctReceived: header to work with, then the recipe would be functional forfetchmail setups.



# 20041212 - forged localhost submission (claiming to be from the MTA system,
# but in actuality being submitted from a foreign IP).
# This recipe relies upon the RELAYHOST variables having previously been
# initialized.
# the RELAYHOSTELHO should match against all known aliases for your host
# - the IP(s), the hostname(s), and localhost.  RELAYHOSTIP should be the
# IPs and localhost IP (127.0.0.1).
:0
* RELAYHOSTEHLO ?? (w\.x\.y\.z|(mail|smtp)\.somedomain\.tld|localhost)
* ! RELAYHOSTIP ?? (w\.x\.y\.z|127\.0\.0\.1)
{
        SPAMVAL="+200"
        SPAMMISHNESS="${SPAMMISHNESS}${SPAMVAL}"

SPAMNOTES="${SPAMNOTES}SPAM: ${SPAMVAL} foreign sender using ourhostname or IP for submission${NL}"

Running this in a sandbox against my spam corpus and regular mailboxresulted in a 0 false-positive rate. Running it against my virus corpusresulted in a hit rate in excess of 45%.

I'm _not_ making the claim that this recipe is going to get rid of gobs andgobs of your spam. What it does for me is isolates a number of the handfulof messages that manage to slip past the rest of my spam filters (and sinceI'm using it as part of a "SPAMMISHNESS" score, it works in conjunctionwith other rulesets, not just on its own). Somehow, last month I ended upwith 44 spam messages which weren't caught by my filters because I've notbeen managing them for serveral months. So far this month, 15 already. Athird of those get nabbed by this recipe (and another third are legitimateforwards from an account on another host which doesn't pre-filter, so theydon't match the criteria in any event).


Comments?
---
 Sean B. Straw / Professional Software Engineering

 Procmail disclaimer: <http://www.professional.org/procmail/disclaimer.html>
 Please DO NOT carbon me on list replies.  I'll get my copy from the list.


____________________________________________________________
procmail mailing list   Procmail homepage: http://www.procmail.org/
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail