Re: Authentication/Reputation, TBR (was Everyone Greylists...)



On Dec 12, 2007, at 11:42 AM, David MacQuigg wrote:

Hiding valid email addresses is a fundamentally insolubleproblem. If legitimate senders can find out when an address isinvalid (which they must if we are to preserve reliability), thenspammers can do the same. Perhaps we can sacrifice somereliability for a little more security, e.g. by sending "nosuchrecipient" rejects only when the sender is authenticated andreputable. The rest can be tempfailed until they give up.
This is solvable. TBR can both limit discovery of valid email-addresses AND greatly improve delivery integrity.
Section 3.2 of the TBR draft says '''
  If no valid RCPT TO address is
  supplied, the TBR command will simply fail.  If at least one valid
  RCPT TO address is supplied, then the TBR eXAM-URI argument will be
  accepted.'''
Doesn't this give the sender the same information as a "nosuchrecipient" SMTP reject?

It is fairly common (although fairly problematic) for inbound MTAs touse different "valid recipient lists" as messages are forwarded totheir destination. With TBR, an MTA may accept any local-part for adomain, where subsequent MTAs are then able to silently eliminaterecipients subsequently considered invalid. Expunging recipient TBRreferences does not mandate DSNs to maintain delivery integrity, sincefetching a message represents acceptance of an obligation to ensuredelivery or report failure.

Providers of domain names, IP addresses, or certificates have aconflict of interest, and are unable to prevent access to spammers.
Unwilling, not unable.
Unwilling, and likely unwise. There remains an inherent conflictof interest. Using registration to limit access may also lead toforms of extortion. What type of non-repudiation protects innocentdomains? Must all messages be signed by a CA? What limits thenumber of certificates issued? Should all certificates be aged 30days before being accepted?
I'm no expert here, but it seems that all of these would beeffective if we had a good reputation system in place.

A reputation system is reactive. Abuse tactics are able cycle withinminutes where reputation would be much after this activity.

Then we wouldn't need high security, just enough to raise the costof abusable IDs above the profit from one short spam run. I'mguessing that profit is somewhere around $100.

Should Gmail charge $100 for their email service? Why expect $100charges against a stolen credit cards will limit abuse?

A corporate registration would do it, or maybe a valid credit cardwith no reported theft in 30 days, or even just a Paypal account.

Domain registrars are not willing to hold requests for even 24 hours.Any credit card or online account can be compromised. As it is now,bad actors are able to publish their domain 24 hours before evencooperative registries indicate which domains are new. The domainregistry process demonstrates the inherent conflict of interest withproviding access and preventing abuse.

We can "piggyback" on the ID checking done by the financial-servicesindustry, which has a lot more to lose than a domain-name vouchingservice.

The many many millions of entities wanting a domain reside in almostevery country using different currencies and communication systems. Aconsortium of postal services might be able to handle a centralizedregistration process. Perhaps by correlating domain names with aphysical postal addresses. This would likely create a cottageindustry aimed at generating domain names used by abusers.

All of this applies only to those who need to "jump start" a newlegitimate ID. A good reputation can also be built by simplysending lots of good mail and very little spam to many receiversover a long time.

There remains an inherent conflict of interest, in addition tocompromised systems infiltrating otherwise reputable domains, and ofcourse more time is needed to accrue meaningful reputations. The TBRmechanism affords perhaps 20 minutes needed to resolve reputationsfrom questionable sources.

Messages can be submitted as legitimate users, but likely originatefrom compromised systems. Correlating source patterns and overallvolume determines the trend.
I think I understand what you are saying. Zombies are sending spamnot directly, but through their ISP's outgoing mail servers. TheISP needs better authentication, rate-limits, etc. Most large ISPs(aol.com, yahoo.com, etc.) have this type of abuse under control.

Some are better than others. Bad actors seek out networks imposingfewer limitations and may see dial-ups or port 25 blocking as alimitation, for example.

Blocking port 25 is not a complete solution. Compromised systemscan bypass such blocks. In the case of compromised systems, abuseis widely diffused such that rate limiting and outbound contentfiltering offers diminished relief.
I'm seeing very little spam from the authorized transmitters ofaol.com, yahoo.com, and most other legitimate senders. Eliminatingoutgoing spam is not that difficult.

With an exponential trend, detection failures may be manageable now,but are not likely to remain that way. Reliance upon contentfiltering appears to be creating increasingly difficult to detect spamand malware.

Agreed, but then this also runs into conflicts of interest. Anemail-provider often does not want credit. TBR demands anidentity, a matching MailFrom, and an MX record. In addition, TBRbetter identifies a problem source, and not just the last MTA"holding the bag" so to speak.
An email provider that "does not want credit", or more accurately,one that "is unwilling to assume responsibility" for what is sent inits name, gets the reputation it deserves.

When the domains are large, blocking just at the domain will cause asignificant amount of collateral damage. When domain level blockingis used to triage limited resources, head of queue blocking willbecome increasing predominate problems.

The last MTA on the sending side (the "transmitter" in myterminology) is more than a "bag holder". Of all the partiesinvolved in handling email, the operator of that transmitter is inthe best position to stop spam. Google.com cannot deny that itstransmitters are sending spam.
Google's problem is likely a conflict of interest. It wants newaccounts so much that it is making it too easy for spammers to signup for these accounts. The fix is not costly, however. They justneed to segregate the new accounts, and until they earn some trust,use a different HELO name, and apply more rigorous rate-limits andfiltering.

A widespread abuse problem has been seen by most large emailproviders, where even Google has improved their process. Eliminatingorganizations who attract large numbers of new users seems counterproductive, as these domains also represent desired sources of email.

When spam must comply with grey-listing, transactions willdramatically increase. The vast majority of email is spam where apercentage can not be detected based upon content.
You may be right. I'm seeing a still small but growing percentagegetting past SpamAssassin.
I can see this heading to either of two endpoints. Either thespammers will win, overwhelming the statistical filters and forcingeveryone to sign up with a large ESP with the resources to work outindividual arrangements with other large ESPs, or we find a way tohold senders responsible, and as you say "shift the burden to thetransmitter", where it is 1000X easier to deal with.
If the spammers win, the anti-spam industry and a few large ESPs canlook forward to a long and prosperous future.

There is no profit allowing spammers to win. Spam represents agrowing waste of resources. Instead of one receiving server accessedover a single path, email now often travels over two or more pathsthrough an array of servers needed to meet a resource demand. When asmall and growing percentage of spam slips past filters, and there isa rapidly growing volume and diversity of sources, customers willperceive a problem with the service.

I hope TBR succeeds, but I suspect that the cost to both senders andreceivers will outweigh the benefit of prompt delivery.

Deployment of TBR would be able to better ensure prompt delivery. Thecost to the transmitter needs to be increased, while also reducingcosts for the receiver.

The cost of publishing CSV records was very little, and it wentnowhere. Nothing wrong technically, just a lack of sender motivation.

I do not agree as CSV can offer path registration as well. This pathwould be by name instead of a large IP address list. CSV can insurean upper limit of a couple of transactions instead of 10 x Ntransactions needed to construct possibly sizeable IP address lists.CSV could have provided an authorization scheme using a sha1 base32label published withn the authorizing domain. This approach scales toany number of servers associated with a domain without increasing thetransactions performed by receivers. Too many wanted fairly dangerousscripts risking DNS instead. : (

I think to get widespread adoption, the cost and risk must be notmuch more than publishing an SPF record ending in ?all, and lessthan the cost of publishing CSV records.

Ensuring messages are handled in a timely fashion will likely justifyrather extensive changes. As the problem grows, moving nearly theentire burden to the transmitter will not be a difficult sell. Itwill likely become perhaps the only choice.

I think senders may need a kick in the pants. I'm thinking of anSMTP reject, with a message like "Sorry, your authentication recordsare insufficient, and we cannot assess the reputation of <domain>.Your message will be sent to our quarantine, but we cannot guaranteedelivery. If it is important, call your recipient to make sure theygot it."

There will not be any new notifications. No one wants to explainanything to users. Its too damn expensive. Things will just becomeincreasingly broken.


-Doug