Re: [Asrg] Re: 3b. SMTP Verification - Reputation Systems and their Prob

Anne P. Mitchell, Esq. wrote:

Oops...1000 apologies - resent with proper subject line:


NP :)

why would any proposed
reputation or accrediation systems of the future be any different?
We're getting a great deal of positive response to our IADB (ISIPPAccreditation Database) even though we haven't publicly announced it yet(if you are not familiar with it you can read about it athttp://www.isipp.com/iadb.php). What are the problems you feel areinnate to DNS blocklists which you feel are also ported to DNSWLs orother non-block DNS lists?

Allow me to explain. There are multiple problems with reputation andaccreditation systems, virtually all of which are non-technical innature. There are a few technical problems as well: suspectibility toDDOS attacks unless the database is distributed and not centralized,whether automatic testing for things like open relays is foolproof,problems with protocols, etc. However I want to concentrate on thenon-technical issues.

There are multiple types of reputation and accreditation systems (pleasefeel free to correct anything). At the most basic, a reputation systemprovides information about a given organization, MTA, IP, ISP, etc. suchas how long has this system been an MTA, average amount of outgoingemail per day, etc. This information is not biased towards or away fromblacklisting, but rather is provided as simply a source of statisticaldata to be used. This is similar to the statistical data provided by acredit agency on a credit report in the US about the accounts a personhas, amount of credit, average balance, etc. The closest example of thistoday is SenderBase.

Then we have the various blacklist reputation systems which providenegative information such as this IP is an open relay, this IP is aspammer, this ISP hosts spammers, etc. This is the most prevalent typetoday, examples abound (SPEWS, MAPS, etc.). A real world example of thiswould be a credir agency reporting negative data about a person such asbankruptcy, bad credit, collections, etc. Human or community-basedsystems such as SpamCop or CloudMark would fall into this category as well.

Now we also have whitelist systems which are not reputation systems perse, but rather are accreditation systems. They allow parties withotherwise bad or unknown reputation such as new providers, IP blocksreassigned from spammers, email that would otherwise be caught by afilter, etc. to be accredited as good guys. Examples of this areBondedSender and your IADB. An example of this in the real world wouldbe a person with bad or unknown credit coming to get a loan, andbringing along a letter of reference or a letter vouching for him from atrusted third party.

I am not saying that they are used specifically by bad guys or peoplewith bad prior reputation, but the main purpose of accreditation systemsis whitelisting email that might not get through otherwise.

There are also differences how different types of systems are managed -some are based on automated testing of open relays and such (SpamHaus'sXBL, MAPS RSS and OPS, etc.), others are based on spam reports frompeople or community (SpamCop, Cloudmark, etc.), yet many are also basedspecifically on human reports and decisions by operators (SpamHausROKSO, SPEWS, etc.). The SMTP VERIFY subgroup has also discussed otherposibilities such as reputation based on "web of trust".

The major problem with all of these is management rather than thetechnical details. Let me go through these:

1. Statistical systems - for these there are several problems. Firstproblem is accuracy - the data collected by the system must be accurateand in order to assure that the collection process needs to be done in astatistically sound matter. In particular, such system needs to covera statistically significant percentage of email traffic to be accurate.Second, problem with statistical systems is lack of openness - thecollection process and internal management practices must be open forpublic scrutiny and preferablly audited on a regular basis by anindependent third party. This will preclude the operators from skewingthe data based on their own motives. Third problem, is the "tyranny bythe mob" - a statistical system must be able to account for thepossibility of multiple reporting systems ganging up together andreporting false data in order to destroy someone's reputation. Thefourth problem is legal, which I am not sure if it would apply. Anne, asa lawyer is more qualified to answer whether such statistical systemsare likely to get sued. The fifth problem is whether such systems raisethe bar for new MTAs on the Internet.

2. Blacklist reputation services - there are multiple problems whichvary depending on how the list is managed. The main problem common toall of these is lack of open procedures about how things get added tothe list, how long they stay on and how they can be taken off. Anotherbig problem is lack of review or any kind of appeal process unlike thecredit agencies in the US which was my real world example. There arealso subtle differences between management types - human and communitybased systems are suspectible to the "tyranny by the mob" problemdescribed above, automated systems need to deal with dynamic IPs andclarify the length of time spend on list and re-testing intervals (AOLretests once a day, not all BLs do), some BL operators tend to listothers on the same block or ISP causing collateral damage, etc. Legalissues very much apply here as we saw from the various lawsuitssurrounding blacklists. Also, in many cases blacklist operators aremaking a decision based on their beliefs which might not the same as thebeliefs of their users. Many ISPs would rather make the decisionthemselves if they had access to the raw data that the BL operators had.That raw data in many cases is not exposed, or cannot be provided in anautomated form. Lack of any appeal process, especially with SPEWS is abig problem as well. Lack of openness in BL operations is an issue. Nothird party audit or review to let us know that the BL operators arefollowing their guidelines.

3. Whitelist and Accreditation services - less problems than others andmore subtle. The main issue is one of a gatekeeper - if a small set ofwhitelists or one is choosen by many large ISPs, that whitelisteffectively becomes the gatekeeper for the email system - "powercorrupts, absolute power corrups absolutely". In order to counteractthat, whitelists need to be very open and auditable by third parties. Ofcourse, currently since we do not have too many of those, it's prettyearlier to say that, but the issue is important to keep in mind. DDOSissues come to mind as well.

In practice it is impossible for an average spam filter to utilize datafrom too many reputation systems, and therefore it is expected that onlya few will be choosen. That would mean we must be careful with whitelistservices to make sure that they do not raise the barrier of entry to theInternet community.


Yakov




_______________________________________________
Asrg mailing list
Asrg(_at_)ietf(_dot_)org
https://www1.ietf.org/mailman/listinfo/asrg

Re: [Asrg] Re: 3b. SMTP Verification - Reputation Systems and their Problems (Modified by Anne P. Mitchell, Esq.)