[asrg] 6. proposal of solution: Using Relay Honeypots to Reduce Spam


ASRG and the end of spam

This started as a honeypot proposal and still ends with a honeypotproposal. Along the way it grew. the growth portion is now first.

ASRG was created to deal with the spam problem. The proper way to dealwith the spam problem is to end it, to end spam. ASRG has considerableinfluence that it can use to help bring about such an end. People expectit to make a proposal to end spam: if the proposal appears reasonable andworkable people and organizations that want spam to end will cooperate inthe effort to end spam. This means that if ASRG provides a framework foraction at several levels by several kinds of participant the action is veryprobably going to happen. the ASRG proposal can anticipate cooperationfrom more than just those participating in ASRG and the organizationsrepresented by those participating in ASRG. As an important corollary, ifthe spammers see that ASRG has formulated a plan and that internet forcesare cooperating to bring that plan to fruition the spammers should concludethat the ASRG will succeed and that they will ultimately be stopped fromspamming by the ASRG plan. That's perhaps difficult to assert or believein the present environment but it should be clear that the describedsituation (spammers see no hope) should be the goal, if it can beattained. Part of the purpose of the ASRG is to determine a plan ofaction, part of the purpose is to determine how successful that plan ofaction will be and part of the purpose is to maximize the results of theplan of action, as finally formulated. What follows is presented as thoughit were that plan. What finally results may bear a great resemblance tothis proposal or may completely supplant it. What matters is whether thefinal plan will be very likely to work, and will attract support from thosewhose support may be necessary for success, will convince the spammers thatthe days of spam are ending.

It is important to think of the scope of operations that ASRG canstimulate. The spam problem can be analyzed into its component parts, ASRGcan make proposals for each entity that might act against a particularcomponent of spam. There need not be a single, monolithic approach toending spam. If the proposed actions are reasonable (and part of the ASRGtask is to search for the most powerful reasonable proposal) then theentities asked to take the actions very probably will take action. (Notethat the same reasoning suggested for the spammers applies to the ISPs thatare in any degree spam supporters: there is very great reason to not wantto be the ISP that was last to end its spam-support activities. One ISPwill be: the goal is to motivate such ISPs to change behavior well beforethe end. If the end is coming and the profit can be seen to be coming toand end anyway the smarter path is to become vividly anti-spam.)

The ASRG solution can be comprehensive, can cover several stages ofoperation. ASRG can anticipate that some spammers, at least for a while,will resist the effort to end spam and will attempt to overcome the ASRGmethods. While the ASRG proposal should be comprehensive enough toanticipate such moves the details of such moves cannot beanticipated. ASRG must, therefore, be prepared to modify or augment theirprogram, as new evidence of spammer evasion becomes known. Again, thestandard for action must be that it is action that will succeed: the newproposals of the ASRG should be convincing to those who will be asked toextend their efforts in order to thwart the modified spammer behavior. Inshort, the ASRG should continue watching the spam process and should bequick to analyze new spammer tricks and to devise ways to combat thosetricks and incorporate whatever changes are needed into their plan.

"Entities" denotes classes of those who can take actions specific to theirposition to help end spam. ISPs that harbor spammers are an entity, ISPsin control of resources spammers abuse are an entity, ISPs that are at thetarget end of spam are an entity. End users whose systems are abused tosend spam are an entity, anti-virus companies are an entity. Softwarecompanies and freeware providers may have a function in the final ASRG plan- they could make available any tools called for by the ASRGproposal. These entities obviously may overlap. the important point isthat different actions can be taken at different points in the spam pathwayand that the ASRG solution can, if necessary, call for action at manypoints, with the actions being those specific to the point. It is quitelikely that part of the final ASRG comprehensive plan will be a call forthe continuation and perhaps intensification of existing anti-spamtechniques. The goal is not novelty but is the eradication of spam. ASRGcan provide a framework and mechanism to enhance the interaction ofanti-spam forces and to intensify the targeting of specific aspects of thespam problem. While not an "entity" as defined above the press and mediamay be important to the ASRG solution. It's not enough that principles andmethods be agreed upon: action must be taken by sufficient numbers of thepeople and organizations with the power to take action that success results.

The first analysis of spam is easy: spam has two types, direct, andnon-direct. Direct spam should be completely controllable by use of blocklists, at least direct spam from spam-only sources. (Filters may alsoidentify and stop direct spam - there seems little reason to considerabandonment of filters as anti-spam tools and the redundancy of the filtersprovides backup for the blocklists in the case that the spammer gets a new IP.)

The rest of spam, non-direct spam, has a dual nature: it is both spam andis also the product of abuse. Much of the abuse is possible because of theoriginal model of the internet in which each user of the internet wasassumed to be trustworthy. That trust is now often misplaced. Theattempted solution to the loss of trust has mostly been to make individualsystems no longer trusting (firewalls, secure MTAs, etc.) That has worked(for the most part) for individual systems but has not cured the overallproblem of abuse. The common protections against abuse are passive andactionless. As such they create little incentive for the abandonment ofabusive behavior. Perhaps ASRG can make a significant contribution byexamining whether this single-system-centered passive approach is effectiveand appropriate. (Looking ahead) honeypots are also passive but differfrom the traditional approach in that there are actions inherent in theoperation of the honeypot and that further actions may be taken based onthe results of the honeypot. (In the broadest view honeypots are actionagainst a specific form of abuse and can expand beyond the single-systemlimit.)

A honeypot (particularly an open proxy honeypot) can be a man-in-the-middledefense. The spammer attempts to contact some resource through the openproxy honeypot. The open proxy honeypot could simulate that contact or itcould allow it. In the SMTP dialog with the spammer's intended target theopen proxy honeypot could function exactly as desired, with oneexception. After the data has been transferred to the SMTP servercontacted through the honeypot the honeypot could silently add an RSET tothe transaction (other schemes to disrupt the communication are possible -the point is that it looks real and is real, with the exception that theresult is nil. The open proxy honeypot could RSET after all recipients arespecified and then create a single recipient of abuse@<the target>.) Tothe spammer it will appear as though everything functioned as normal - itdid. The important exception is that the spam did not go through - it wasobliterated. (For completeness the open proxy honeypot should fully logthe transactions form the spammer-sender.) Whether the open proxy honeypotis being used to contact an open relay or is being used to send the spamdirectly to the victim doesn't matter: either way there is no spamdelivered. The Open proxy honeypot need only detect that port 25 is beingcontacted through it and to modify the dialog such that no spam is transmitted.

The long-term ASRG plan may include a replacement or strengthening of theSMTP protocol. What ASRG does need not be limited to a singleaction. Protocol enhancement may be the key part of the ASRG plan but thecurrent intense level of spam calls for immediate action yo substantiallyreduce the amount of spam that reaches all users, as a whole.

What follows was written in response to a message from Paul Judge. Itstarts by quoting what he suggested should be incorporated in a fullhoneypot proposal. It isn't all there, but most of it is. To me the mostimportant point is the point made at the start of this posting: ASRG cantake major action. Relay spam honeypots, as done, have been of the form ofminor isolated action. If you broaden the concept to include action for anentire network segment the power of the approach is increased. While thedescription of honeypots is useful the power, to me, is in the idea oftaking concerted action to end the abuses committed by spammers to sendspam. Whether or not this results in a solution that looks like honeypotsis unimportant. What matters is that spam be ended.


Paul's suggestions:

You should describe the benefits of R.H.
Describe how to deply one from scratch
        -who should do it? Large corporations? Individuals? A single
non-profit?
Give some metrics.
        -how much spam will one see
        -how many are needed to make a difference
Discuss countermeasures
Discuss deployment issues

Put this in the form of a document that someone can pick up and read. It
should convince people to run relay honeypots and show them how to do so. It
should include all of the ideas that you have thrown around in the different
emails.



Background:

That spam comes by abuse pathways is well known. Spammer Alan Ralsky isreported to control 190 email servers, including 160 in the US.(http://www.freep.com/money/tech/mwend22_20021122.htm) Ralsky does not senddirect spam - he must therefore send spam via an abuse pathway. Similarly,many other spammers also send abuse spam.

Historically the abuse spam was sent via open relays almostexclusively. Recently these have been supplemented by open proxies and byopen proxy - open relay combinations. Open relays are still used, arestill a problem. When open relay DNSBL services shut down because theyhave no function it will be known that open relay abuse is no longer asignificant part of the spam problem. Similarly, when open proxy DNSBLsshut down for a similar reason open proxy abuse will no longer be acomponent of the spam problem. Neither of these shutdowns seems imminent.

That these spammers use abuse means two things: there is a constant flow ofabuse packets from their servers and these packets are directed to systemsvulnerable to abuse that the spammers have discovered. That they discovervulnerable systems means that they look for vulnerable systems. No doubtdifferent spammers have different practices - some may seek abusablesystems only overseas, some may seek them only in the US, some may doboth. Little data has been collected to determine what these practices arein general or for particular spammers (or groups of spammers, ifany.) Operators who set up relay spam honeypots are generally successfulin detecting and delivering relay test messages. Many operators of emailsystems report frequent log entries for rejected relay messages. Theevidence is that spammers appear to check for open relays essentiallyeverywhere. If it is necessary to the spammers then the practice creates avulnerability on the part of the spammers. the intnet is to fully exploitthat vulnerabilty.

Whether or not the abuse meets the federal standard for action the abuse istheft of service. It is appropriate to report such abuse to the ISP fromwhich it originates, it is appropriate for the ISP to terminate service tothe customer guilty of the abuse. Surprisingly, some ISPs seem completelyunaware of the abuse and of its implications.

Honeypots, in this context, are systems set up to appear to the spammer tobe vulnerable to abuse but not be vulnerable - some key part of the abuseis intercepted, usually delivery of spam. Functions of honeypots vary butinclude detection of the tests done by spammers to discover and verifyabusable systems and delivering such tests so that the spammer deceiveshimself into believing the tested system is vulnerable. When the spammerso deceives himself he may send relay spam to the system, spam which thesystem makes sure is not delivered. The basic honeypot is a single systemwith a single IP address. Those in charge of larger aggregates (e.g., fullnetwork segments of /24 or larger size) may be able to create gianthoneypots, in which port 25 (or proxy port) traffic directed to IPs thatdon't service the port is diverted to a master honeypot.

This document will emphasize open relay honeypots. Similar thinking canlead to design and implementation of open proxy honeypots.

An open relay honeypot can be, broadly speaking, one of two types. It canbe a standard MTA configured or altered to make it a honeypot or it can benew code, specifically written to function as a honeypot. The designcriteria of an ideal honeypot are simple: always look like an open relay tothe spammer, never deliver spam. Additional criteria may be chosen thatdefine the mode of operation of the honeypot. It might be designed to befully automatic, for example, or might be partially automatic and partiallymanual. At the simplest level an open relay honeypot accepts and deliversthe relay tests sent by spammers. As the honeypot is supposed to deliverno spam and as the act of delivering a spammer relay test usually leads toa spammer sending spam the honeypot needs to have a way to distinguishrelay tests from spam. Several approaches have been used. In oneapproach each new message in the mail queue is examined automatically forrelay-test-like character. The easiest mark of such character is thepresence, in the message, of the IP address of the honeypot itself - the IPaddress of the open relay is the payload of the test message. This addresscan either be plaintext (standard dotted quad) or it can be encoded. Avery frequently seen encoding is to re-express the IP address in itsdecimal ascii form and to put that before the "@" character in themessage-ID. A third form of IP encoding is one in which the periods in thedotted quad representation are replaced by slashes and each digit of therepresentation is replaced by the next higher digit. Thus, 192.168.10.200would become 2:3/279/21/311 (the : is what has been observed as the symbolfor 9 + 1.) As these addresses always appear with the string MAILINF0 thatstring could be just as appropriate for recognizing relay tests, at leastbefore spammers get clever.

A user deploys a honeypot by one of several methods. If the honeypot isbased on a standard MTA the honeypot is installed on a compatible systemthat has no MTA but does have a network connection and an ethernet cardwith an IP address. If there is no such system an older unit may perhapsbe pressed into service. Successful honeypots have been based on a 100 MHz486 DX4, on a 120 MHz PENTIUM, and on a Vaxstation 4000/90. The honeypotneed do nothing compute-intensive - it can be an older system of limitedcapability and still succeed. If the honeypot is one usinghoneypot-specific software it should be one compatible with the software,with nothing else using port 25, and no vital function that is put atgreater risk by the implementation of the honeypot. An example of such aprogram is Jackpot: http://jackpot.uk.net/

A very effective honeypot is one that is substituted for an open relay thatis already being abused. If the open relay is on a system that has no realneed for an MTA the existing MTA can be stopped and an appropriate honeypotinstalled and run. If the system has an email function then its IP numbermight be re-assigned and the honeypot put on a new system that is given theIP number taken from the old system. Regular email follows DNS - spammersgenerally use IP numbers to contact the systems they abuse, so subsitutinga different system for the one previously at a particular IP does workagainst the spammers.

A relay spam honeypot can be successfully run by anyone connected to a netsegment the spammers test for open relays. This appears to include some orall IPs in the US, great Britain, Korea, China, Taiwan, Denmark, Germany -anyplace the spammers look for open relays is appropriate for ahoneypot. This means top-level ISPs can do it, it also means home userswith DSL or cable connections can do it. It is important that honeypots berun by more than just anti-spammers The real power of honeypots comes whenthey exist in large numbers - this requires that they be implemented beyondthe anti-spam community. Honeypots can be simple enough that this isreadily possible.

Honeypot users will see varying amounts of spam (if they deliver relaytests.) What they see depends on which spammer or spammers discover anddecide to use the apparent open relay. Some spammers will hit a relay witha flood of as much spam as they can pump out. Others send meteredamounts. At one time it appeared a volume of around fifty 20-recipientmessages/day was typical for a honeypot (and an open relay) in the US(based on a less-than-representative sample of one.). Last February theMoscow honeypot started working and trapping orders of magnitude morespam. Another foreign honeypot received spam at high levels, with burstsof well over 1 million recipients/day for the trapped spam. (In the firstyear of operation that system stopped spam to 281 million recipients, withan average of less than 1 million recipients/day.) The trapped spam isn'tall that is important - the relay tests are the problem and the key to theaction of the honeypot. The real purpose of honeypots is to disrupt theability of spammers to find open relays. A natural consequence is thatmost honeypots will also trap spam - that's because they disrupt theability to find open relays by masquerading as open relays. If they don'taccept spam the masquerade isn't very good. Nonetheless the goal is tomake discovering open relays so difficult that the spammers give uptrying. That will lead to the quicker end of relay spam. (Similarly foropen proxies.)

While it is a secondary function the spam-stopping power of individualhoneypots is important, does make a small difference. Compared to thetotal daily spam volume any small number of honeypots has little realimpact. That partly works to the advantage of current honeypot operators:if they make no real difference the spammers won't notice that honeypotsexist. Stopping spam is a side-effect of the real power of honeypots. RFC2505 says that securing open relays is not an approach to ending spam. Thereason is that spammers will continue to discover open relays, so that evena 95% success in securing open relays won't stop spam. The key word isdiscover: the problem is that spammers can discover open relays. Anyonecan: try to relay an email message through a million IPs and you'll findsome that will. Spammers work the same way: they look for IPs that willrelay. When the only relay-level anti-spam countermeasure is to secureopen relays the spammers see a simple division of systems on the internet:those that don't deliver their test messages and aren't open relays, thosethat do deliver their test messages and are open relays. It's very simplefor the spammers: if a system delivers a test message it is an open relaywith near 100% certainty. That's what makes detection of open relayssimple for spammers. The real purpose of honeypots is to disrupt thedetection of open relays. It can be regarded this way: as long asanti-spammers can build a good list of open relays spammers can do thesame. In a way the spammers and the anti-spammers work cooperatively tobuild a good list of open relays. Spammers can and have used anti-spammerrelay lists to find usable open relays to send spam. Anti-spammers add totheir lists any open relays discovered by the spammers that become knownthrough relay spam reports.

To really disrupt the spammers may take a number of honeypots equal to orgreater than the number of open relays. If the numbers were equal then thehoneypots would be expected to be receiving roughly half the spam, a 50%cut in spam volume. At that level the spammers need only double theiroutput to keep the same delivery level. (Someday the headroom forincreasing volume will run out - then nothing the spammers can do will keepup the volume.) This analysis neglects the complaints that many honeypotoperators can send - complaints about attempted theft of service,complaints about relay test messages. These complaints multiply theeffectiveness of honeypots since they help disrupt the entire spammingoperation.

While this description is written in terms of open relay honeypots theadvantage may be with open proxy honeypots: these days an open proxyhoneypot is more likely to receive a direct connection from a spammer(making it possible to track the abuse to its source.) The design of atleast one kind of open proxy honeypot is simple: intercept all port 25traffic for other IPs and direct it to an SMTP honeypot (integral to theopen proxy honeypot or external - whatever works.) Other designs arepossible. The important considerations are, as with open relay honeypots,that the honeypot deceive the spammer and that the honeypot not deliver spam.

The prime countermeasure a spammer can take is to stop sending spam to thehoneypot, once he discovers it is a honeypot. In other words, the spammerdoesn't send spam. That's the overall goal, for all destination IPs(0.0.0.0/0) - the spammer is doing the right thing if he stops sendingspam. The real issue would appear to be that of how easily the spammer candiscover the honeypot. If the goal is to make discovery of true openrelays too difficult for the spammer to tolerate then if honeypots are usedthey have to be difficult to detect, have to look very much like true openrelays.

An isolated honeypot should be easily detected, if the spammer tries. Heneed only send some spam addressed to his own dropbox and use the fact ofnon-delivery to establish that an IP is a honeypot (other detection schemesare possible.) Creating a situation in which it is necessary for thespammer to detect honeypots already has made open relay detection moreexpensive to the spammer, more difficult. The goal is to increase thedifficulty until the spammer gives up. The original deception that got thespammer to send spam was to deliver his test message. The same philosophyholds for later test messages, including spam addressed to the spammer'sown dropbox: if you deliver them the spammer is fooled. (In the past somespammers have sent their standard test message simultaneously with a spamrun to discover if the relay remains open. That's trivially easyto handle.) If the spammer sends spam to his own address then he probablywill use that same address multiple times. The honeypotcounter-countermeasure may be to deposit all spammed addresses in a centraldatabase, shared by a consortium of honeypot operators. If the spammeruses a test address with any frequency that address will receiveproportionately more spam than do the ordinary addresses. Once a testaddress is identified honeypots still working can simply deliver any spamthat comes for that address, fooling the spammer.

Lower-level honeypots can also be run - ones that only trap relaytests. These, too, will be detectable by the spammer. If the operatorbelieves his honeypot has been detected by the spammer and if he hasanother IP number available he can simply change the IP number andcontinue. Eventually the spammer will discover that IP, and so on. Thegoal would be for the spammer to abandon consideration of the entirenetwork segment as having potential abusable open relays. Note that ifthe spammer regards acceptance of a test without subsequent delivery of thetest as evidence of a honeypot it is very beneficial for operators of freeemail services used by the spammers for their dropboxes to divert emailmessages to those dropboxes but to leave the accounts active. This looksthe same to the spammer as does a failure of the tested IP to deliver. Theresult could be that the spammer will conclude that an actual open relay isa honeypot because the test message was accepted but the spammer neverreceived it (just as for the honeypot that accepts but doesn'tdeliver.) The goal, always, is to disrupt the spammers' ability to testaccurately.

-o-

Where do honeypots fit in the draft taxonomy(<http://www1.ietf.org/mail-archive/working-groups/asrg/current/msg01794.html>)?


In 1. a), as v) Intercept spam

In 1. a), as vi)  Destroy ability to find open relays (open proxies)

In 1. b), under ii, Tracking.

In 2. as c)  If it's trapped by a honeypot and isn't a relay test: it's spam.

In 3) h) Feedback. Honeypots may discover large numbers of open proxy IPS,may discover spammer IPs.

In 3) k) Teergrubing is possible with a honeypot: Jackpot implementsit. Some hackback potential exists: the honeypot may be the spammer'sfirst contact point in the spam chain. The ethics of causing an open proxycrash by hackback could be debated - there's some justification for thataction.

In 3), as l) Raucous laughter. You can become very amused at a spammersending copious amounts of spam into a black hole. It's even more amusingif the spammer uses some address "trick," to exploit an address-formvulnerability.

Drat, I forgot. Part of simulating an open relay is simulating a bumblingoperator. A honeypot might simulate rejection of standard email addressesbut simulate vulnerability to one of the other forms of address spammerssometimes use. Similarly, a honeypot can have periods of down time andperiods in which it responds with a "disk full" message. I've sometimestotally blocked a spam-source IP: a real bumbling operator might dothat. The spammer need only change IP to get back in. Golly, he got meagain. :-)

Where do honeypots fit in the list of requirements(<http://www1.ietf.org/mail-archive/working-groups/asrg/current/msg01721.html>)?

1. They do reduce the level of unwanted messages. This would more or lessbe a linear function of the number of active and successful honeypots.2. They have nearly zero effect on all valid messages (only extremelyunlikely situations lead to any disruption at all of valid emailmessages.). The honeypot might contend for bandwidth with a localserver. That should be the full extent of the interference, unless somehotshot rogue blocklist somehow learns the honeypot IP and does an expandedlisting around it. No such blocklist operator is known to exist.3. The honeypot can be easy to use. For Jackpot you download Jackpot,download a JVM, and run Jackpot. You can configure Jackpot permanently, inthe jackpot.properties file or you can make configuration changes usingJackpot's built-in web interface. The web interface also allowsexamination of trapped tests and spam.4. The honeypot may be easy to deploy. Jackpot is easy, as described. Asendmail honeypot requires some special configuration. That's easy, insendmail terms. Other honeypots could be implemented to beeasy. Honeypots could be distributed as an optional package to use withanti-virus software or as an optional package with hardware or softwarefirewalls. Honeypots could be made part of the standard distribution ofoperating systems (e.g., Linux.)5. Honeypots do not depend on universal deployment to be effective - manyof their features exist and have power at the single-implementationlevel. To disable the ability to find open relays they will have to bewidely deployed. It will probably help tremendously if there are manyversions and flavors of honeypot. Much of the power of honeypots dependson spammers continuing to search for open relays. If spammers decide tostop searching for open relays and use just the ones they know then the warwill become one of attrition, as existing open relays are secured (withthe best method of securing often being conversion to a honeypot.)6. Privacy. The operator of a honeypot that is successfully trapping spamlearns the email addresses of the intended victims.7. Administration and implementation overhead. More than one level ofhoneypot can be deployed. Simple honeypots have minimal requirements. (Ifhoneypots become plentiful then a honeypot that simply accepted relay emailmessages and discarded them would be effective. The spammers have no wayof knowing which of such systems are he ones that lead to complaints to thespammers' ISPs. They have to worry just as much about such a low-functionrelay test message trap as they do about a honeypot run by the most activeanti-spammer. ) Activates based on data collected by the honeypot may taketime. Jackpot honeypots can report their activity to a central web server,sendmail honeypots could be very simple installations, not honeypots atall, and simply smarthost all email to a central master honeypot. (Thatmay carry a somewhat greater risk of spammer discovery but is possiblle.)8. Bandwidth is whatever the spammer consumes, as limited by the honeypotor by other means. Computational overhead is negligible - it's about like amail server.9. Robustness. Some forms of honeypot are detectable by easy means. Theresponse is to stop testing that IP for being an open relay.10 Legal issues. I'd probably die laughing if a spammer sued me for notletting him steal my service to send his spam. If used at all to triggerlaw enforcement there might be an entrapment defense. Just hooking asystem to the internet is hard to see as being entrapment.


Who can use these ideas to fight spam?

Anyone who has contact with a resource the spammer uses or abuses in hisactivity. The spammer's ISP can detect testing for open relays and takeappropriate measures. The ISP of the systems being tested can takeappropriate measures. The operator of tested systems can take appropriatemeasures. The operator of a freemail service used by the spammer for hisdropbox can take appropriate measures.

Conclusion: This could be a better and more formal proposal but as it isit should serve to provide material for consideration and discussion -that's the main goal. There is a long history of people finding fault withhoneypots simultaneously with real honeypots operating and havingsignificant effect. The proposal is more about fighting spam by fightingthe abuse committed by spammers to send spam. The spammers are vulnerableto such a defense, such a defense, if pressed vigorously, should greatlyaffect the ability of spammers to continue operating. That's thegoal. Honeypots are already working - their number could be increased atany time to increase their overall impact (there's no significant delayconnected with their adoption - no RFC process that need befollowed.) Spam is a huge problem today, honeypots work today. They needto receive careful consideration as a major component of the battle againstspam.




_______________________________________________
Asrg mailing list
Asrg(_at_)ietf(_dot_)org
https://www1.ietf.org/mailman/listinfo/asrg