Off-topic: mydnsbl (my "too many failures BL") moving from investigation

This doesn't have to do with SPF, but may be interesting to some folkshere. If you are interested in more info, please reply to me off-list.

The most interesting thing about this project is that I found out thatabout 20% of our mail comes from about 2500 IPs which have sent us 9 out of10 bad transactions within the last two hours. Since we get a LOT of bogustraffic, I have been thinking of some ways we can harness the power of allthat crap for good instead of evil :)

ObSPF: It will be interesting to see which domains these "almost certainlyzombies" use to send from. If they pass SPF even from a zombie's location,that probably means the domain should be blacklisted along with the IP. Ifthey fail SPF, even more reason to block that IP (especially if they forgelots of different IPs).

This is the latest draft of "mydnsbl," which is a personal project I'vebeen working on. It's been about half work time and half personal time. Asof now, the software seems to work OK, and the docs are pretty complete(for testing purposes anyway). I will soon be moving on to the moredaunting task of trying to test my new DNSBL with actual user mail, if Ican convince management that it's safe.

I'm posting here in case any of you are interested in doing something likethis at your site, and also to get feedback, comments on things I mighthave missed, hints on how to phase it in and test it, etc.

(I cut this back a bit for posting... the full explanation of the projectis at <http://www.livejournal.com/users/gconnor/121193.html>)

"mydnsbl" is a script that reads syslog activity from a mail server, andcreates a DNSBL based on the "bad" activity. The idea is that I want tokeep track of the last 10 transactions from each IP, and if 9 of the last10 transactions were user unknown, then that IP should go on a local DNSBLfor something like 2 hours.

"Bad" activity in this case is considered to be user unknown, mailing tointernal-only recipients, known spam, known virus, and basically anythingthat results in a failed transaction at your mailer for reasons not yourfault :) This bad activity is offset by "good or neutral" activity, such asdelivered OK, possible spam but not sure, and anything that results in anactual delivery.



Currently there are two pieces of the puzzle working.

http://www.nekodojo.org/~gconnor/mydnsbl/myscanner

The "myscanner" script tails a logfile where my mailservers send theirsyslog output. It takes multiple lines with the same transaction ID andputs the pieces back together, so that the output contains one line pertransaction, telling the IP and the result. This is good for mailserversthat output activity "as it happens" rather than one line per transaction.If you can convince your mailer to output (IP,result) on one line, youprobably don't need myscanner.

Currently "myscanner" only understands Barracuda output, but a similarframework could be used to make sendmail logs into transactional output. Itis currently highly dependent on our specific output though. (There aresimilar programs or perl modules out there that produce summarized outputfor Sendmail and maybe others.)

(cut most details of this piece, see<http://www.livejournal.com/users/gconnor/121193.html> for full version)

Note that if your mail program already reports the result (Sent OK, Unknownuser, spam, unknown domain, etc) on the same line as the IP, you probablydon't need myscanner; just alter mycollect to interpret the log format ofyour mailer.




http://www.nekodojo.org/~gconnor/mydnsbl/mycollect

mycollect keeps track of every IP seen so far in a hash, and with each IPit keeps the result of the most recent 10 transactions, where "result" iseither bad, ok or wtf. If 9 of the last 10 transactions are "bad" then theIP is added to the internal "blocked list". The current blocked list andits expire times are kept in memory, and dumped out to a disk file every 5minutes. The output is just a list of IP addresses, so it will work withrbldnsd but I will eventually add a preamble and some formatting.

mycollect also keeps detailed statistics, which was its main job during theinvestigation phase. I wanted to get detailed info about how many IPs wouldbe blocked, and how many messages that made it through would have beenblocked.

Statistics are reported to STDERR at every 100,000 transactions (if youlike) or when the program receives USR1 signal. Output looks like:


# kill -USR1 %1
#
From: Jan 16 00:46:07  To: Jan 16 02:48:48
total = 300000 (100%) (rbl = 65, ok = 1, bad = 32)
would block = 40027 (13%) (rbl = 0, ok = 0, bad = 12)
cache size = 18887, blocks size = 2609
usertime=329.18, systime=10.11, SZ:RSS=4940:4487
... (later) ...
From: Jan 16 00:46:07  To: Jan 16 06:53:16
total = 1000000 (100%) (rbl = 63, ok = 1, bad = 35)
would block = 176401 (17%) (rbl = 0, ok = 0, bad = 16)
cache size = 19346, blocks size = 2535
usertime=1374.61, systime=33.5, SZ:RSS=5344:4898

This indicates that after 300,000 transactions (about 2 hrs), 2609 IPswould be added to the blocked list, and 40,000 of those transactions wouldhave been avoided, if the DNSBL had been really used. Later, after 1Mtransactions, we have 2535 entries on the BL, and would have blocked176,000 transactions (17%).

It looks like most of the mail that would have been blocked would haveresulted in "User unknown" or would have been caught by other tests anyway,but the real test will be to compare the messages that would have gottenthrough (ok) but are now stopped, to see if the current "ok" number dropssignificantly.

Any feedback is appreciated. Right now it is pretty customized to myenvironment, but should be pretty easy to adapt to other types of input. Ifyou feel like playing with it and running your own output through it,please feel free. I would be interested to see what kind of numbers youcome up with for your site :)


Thanks for taking the time to read!
gregc
--
Greg Connor <gconnor(_at_)nekodojo(_dot_)org>

Off-topic: mydnsbl (my "too many failures BL") moving from investigation to testing