Re: [ietf-dkim] Re: An overview of cryptographic protocols to preventspa


On Sep 26, 2005, at 1:33 PM, Amir Herzberg wrote:

Douglas Otis wrote:
Any advertisement, labeled or not, representing one of perhaps aninordinate number of such messages should _still_ be viewed asspam. Such messages are primarily in the sender's interest, andprimarily at the expense of the recipient.
If properly labeled (e.g. with ADV:) then filtering them becomestrivial. And if you are willing to accept ADV: (or other label)from specific senders, use appropriate authentication mechanisms(e.g. DKIM) to allow these senders (only) and block ADV: from others.

An abuser can easily implement DKIM or any other authentication andauthorization scheme. Authentication by itself provides littlebenefit. Mailbox-domain authorizations alone provide even less, asthis weak form of authorization is easily exploited. However,authentication plus reputation would be able to curtail a majorportion of spam/virus/phishing threats.

A labeling definition that categorizes unsolicited messages as notspam is _not_ useful. The primary consideration regarding whether amessage is abusive is whether permissions were granted by therecipient. This consideration is especially important for bulkmessages offering disproportionately greater value for the senderthan recipients.

A "verified" label plays _no_ role in deciding whether a message isabusive. A message that falsifies a header or label may beconsidered fraudulent, but current email practices allow unseenheaders to be considered that of the sender, and do not define howprior headers are assessed.

A school using a T1 line for Internet access may be unable toaccommodate ADV+DKIM messages and still achieve reasonable Internetaccess. There is still cost associated with ADV+DKIM that you ignorewhen you claim there is a means to "filter" messages. This"filtering" after the exchange still costs the recipient that has nodesire to receive the messages.

If 'ADV:' on the subject line were to mean "this is not spam,"then of course every spammer would use this label.
Not when ADV: also means `this is an ad` and 99.9% of users blockit (at least from all but very few selected senders)... And thisis, in reality, already the case. Spammers rarely put ADV:... evenwhen required by law. Hence the need for `Internet enforcement`.

Laws allow any number of cases where labels may be bypassed.Fortunately, recipients may establish their own criteria ofacceptable behavior. For simple enforcement, labels MUST NOT be acriteria for bypassing a more general requirement of not sendingunsolicited bulk email.

If it were used conscientiously to genuinely indicate anadvertisement to an individual requesting such information, thenof course such a message should not be filtered. As there mustbe a mechanism based upon reputation to determine the integrityof the sender anyway, such labeling would be of extremely littlevalue.
Here I disagree. Current mechanisms use _implicit_ labels of `noads, no virus, ...` - and if you'll read my text, I indeed saidthat `no label` equals a default of `not falling into one of therequired labels`. The labeling mechanism allows senders to send anad to customers who want it, and allow me to send a virus to ananti-virus researcher. I think this is important for free speechand some legitimate usage scenarios.

I can not imagine a label scheme that would safely disable anti-virusprotections. With respect to ads, either messages are desired andtherefore are not a problem with or without a label, or they shouldnot be sent. Freedom goes both ways. The recipient is equally freeto say "No Abusers." The recipient is free to depend upon acommunity assessment of abusive sources. A label scheme attemptingto offer a "consent variance" increases enforcement costs andtherefore would be of _NO VALUE_.

Assume responsible senders would cease sending advertisement whenrequested, and that such responsible senders also predicatesending based upon a request or granted permissions.
Yes? And how would a recipient know that a sender is `responsible`?Based on unproven assertions by blacklists etc? Having signedlabels allows one to _prove_ that somebody cheated.

What do you mean unproven assertions of a black-hole list? Theproof is whether messages were granted or unsubscribed. A label orlack of a label has no bearing with respect to the nature of abuse.

In any case, may I suggest you respond to me privately or in anappropriate forum e.g. asrg, since I think content labels arenot part of DKIM (of course DKIM can be applied to sign suchlabels).
DKIM itself could be viewed as a type of label that can be verified.
DKIM signatures do not contain description of the content (i.e.content label). I was not discussing any `label` just `contentlabels` .

A DKIM signature says this content has not changed and was"permitted" by the signing-domain. DKIM offers a content label thatactually has value, unlike the labeling you describe.

The domain of a compromised router does not need to be within therecipient's domain, as you indicate.
Didn't understand.

You have dismissed a risk by assuming the victim controls acompromised router. DKIM in conjunction with DNSSEC offers asignificant advantage for inhibiting such attacks.

There is a difference between a black-list and a black-hole list,where black-hole list would be the preferred terminologydescribing an IP address qualification mechanism.
Why do you think this is a better term? How do you define thedifference?

A black-hole list is terminology for black-holing (creating a dead-end) for specific addresses emitting abuse. One form of this list inBGP is called black-hole routes. From a legal perspective, black-listing describes specific actions unrelated to black-holing. Seekthe advice of a lawyer for further clarification.

The path based registration schemes are very prone to intra-server attacks, in addition to man-in-the-middle attacks.
What is an intra-server attack? You mean attack on DNS server??

Unlike DKIM, path registration is really just server authorizationprovided by a mailbox-domain. A domain owner using a provider thatdoes not properly assert mailbox-domain constraints, are exposed to"intra-server" abuses by any other client of that provider. Afterall, path registrations are public. You mentioned the problemknowing which mailbox-address is even being checked. This creates asimilar problem at the outbound server as well.

Many MTAs are shared by multiple domains. There should be somemention that mailbox-domain authorization schemes attempt to baseauthorization upon visible headers, where this then violatesnormal conventions.

You have two examples. The PRA in Sender-ID and the SSP inconjunction with DKIM. The PRA locates an authorized list ofaddresses; the SSP locates an authorized list of signing-domains.For various reasons, neither scheme follows SMTP conventions.

This is also true for SSP. The solution often used for caseswhere authorization would inadvertently cause a message to belost, is to use open-ended authorization. Open-endedauthorization may invite exploitations and may cause messages toplaced into "junk" folders, rather than rejected with anindication of a delivery failure.
Sorry - didn't understand this paragraph. Please clarify.

As all mailbox-domain authorization schemes will fail in variouscases, the mitigation involves leaving authorization open-ended,meaning the authorization never totally fails, but may result in themessage being placed into suspense, a separate queue or folder.

Your "ALL" chart lists '+', where this could be seen as the"politically incorrect" mode.
What do you mean by PC here?

While your explanation of the meanings could be derived from thedrafts, this ignores risks associated with unfair reputations accruedagainst the mailbox-domain by various email plug-ins beingannounced. By not PC, this '+' symbol would be the "middle-finger"defiance mode. : ) For SSP, there is the 'y' or the '~' mode insteadof '~' and '?'.

The normal approach would be to use the open-ended '?' mode.Characterization of path based registration as being simple toimplement or alluding to lower CPU overhead is misleading. Thesepath based schemes may require an inordinate level of DNS lookupsconsuming limited I/O resources, whereas CPU resources used forcryptography are generally available and otherwise unused.
I meant path based is less `expensive` than content-based; and onthe CPU vs. I/O, I think the story is not clear, esp. since mostcrypto proposals also involve DNS queries. I'll try to clarify.

Consider that crypto proposals (ignoring SSP) are more deterministicwith respect to the I/O overhead. Your statement then appears toassume there would be a type of white-listing that bypasses contentfiltering. In the few cases where such bypassing might be safelyavoided, there is not enough volume to justify the risk for makingexceptions.

You also question the value of utilizing reputation based uponthe domain. The domain does carry more information than just theIP address. IPv6 addressing may create a similar situation. Aname offers the age and the registrar of the domain.
But my concern is that registring names is cheap (and of coursecould be done in advance). Any solution to this concern?

Although a database for domain names may be large and can growwithout bounds, names provide a history and should cause lesscollateral blocking. IPv6 offers the same potential increase in thesize of the database.

From a cost standpoint, collateral blocking perhaps accounts for themajority of complaints related to listing services. There is alsothe problem of Zombie systems using automated access to provider'sservers where messages are sent an inch deep and a mile wide. Anopaque-identifier in conjunction with DKIM would be a powerful toolto combat this emerging threat. It would also help distribute black-hole listing, as larger problematic domains could publish their ownlists or perhaps delegate the zone to a specialized service provider.

When considering the number of shared MTAs, the use of pathregistration remains dubious, whereas being able to verify thename offers greater value. Even with DKIM, the mailbox-domainmay not be assured and not be verifiableThis is also true for the path registration techniques.Authorizing a mail service _does not_ indicate that mailbox-domain originated the message. There is far greater risksassociated with "poisoning" reputations based upon mailbox-domainauthorizations, which are mechanisms being currently proposed.Basing reputation upon DKIM signatures should eliminate mostpoisoning concerns. This would assume that a replay mechanism isin place.
I think we agree here so I'm not sure if this is a comment/criticism/suggestion on my write-up, if it is, where do you think Ishould clear it up?

I was reacting to the oversight of the risks associated with mailbox-domain authorization and the rather quick dismissal of valueverifying the domain. As example, in the HELO section, HELOverification offers significant value for DKIM or any other domainbased authentication scheme from a resource protection standpoint.The alternative would fail-over to the remote IP address. This ofcourse assumes a domain based reputation scheme is available, inaddition to the IP address black-hole list.


-Doug

_______________________________________________
ietf-dkim mailing list
http://dkim.org

Re: [ietf-dkim] Re: An overview of cryptographic protocols to preventspam