Re: Differences between CSV and Sender-ID


Hi Dave,

I'm not exactly pleased to see my post carved up, with a number of 1-3 lineresponses added for every 1-3 lines I wrote. I feel it is a style thatencourages fighting, bickering, and disagreement rather than mutualunderstanding and consensus-building. However, I don't believe at all thatyou meant to be rude or anything. Perhaps you interpreted my message as anattack against CSV and reacted defensively?

Anyway, you raise some good points here, so I will attempt to reply asconcisely as I can. It's going to appear disjointed but I will try not tobreak things up too much further than they already are.


--Dave Crocker <dhc(_at_)dcrocker(_dot_)net> wrote:


Greg,


GC> A HELO check can catch some really obvious bad cases (like spam or
viruses GC> using the receiver's own name) and some obvious good cases
(like people we GC> want to whitelist).

With respect to CSV, I think this needs to be phrased quite
differently, even though the semantics will probably seem similar, or
at least not contradictory. The reason for this need is that I think it
leads to a very different perspective on the implications of a CSV
mechanism versus an SPF-like mechanism.

HELO makes an assertion about the operation and accountability of the
MTA. There is quite a bit of history and current use of services that
vet sending SMTP clients and their network operators.

A HELO mechanism check can be used to produce a domain-name based
codification of such checking, rather than requiring that white/black
lists be maintained in terms of IP Addresses. The benefit of having a
domain name base involves all of the reasons we all like domain names
better than IP Addresses, for use by humans.

Without commenting on mechanisms, I totally agree with your explanation ofHELO and its significance. I was attempting to keep it a bit simple forthe readers. The two paragraphs above are a good explanation as to whyHELO is significant, and why checking it (by whatever mechanism) isdesirable. All is well.

GC> CSV also uses HELO to tie a reputation to the sending MTA.

The concern about accreditation (of which "reputation" is a subset) is
rather interesting, here. What folks seem to be missing is that ALL
mechanisms that involve acceptance or rejection based on a name or
address has an accreditation component. Accreditation is, in fact, the
acceptance or rejection policy engine.

CSV merely specifies two external standards for such a mechanism.

So, yes, a mechanism that only seeks to detect forgeries does not have
an accreditation mechanism.  But forgive me if I am wrong:  I thought
folks were interested in detecting and preventing spam, and spam is
very, very much about accreditation, not forgery.

Forgery is a current symptom, rather than a core aspect of spam and
virus sending.  Eliminate forgery and there will still be masses of
spam and virus sending.

I had rather hoped we were trying to get at core issues nof fighting
spam, what with the scale of the problem and the cost and delay
inherent in any standards effort.

My statement wasn't meant as a negative. I actually agree with what yousaid here. CSV hangs reputation/accreditation on HELO, SPF chooses to hangit on another identity. I certainly didn't mean to imply thataccreditation is not important.

GC>  This seems to
GC> be based on the assumption that good mail comes from good MTAs, and
bad GC> mail comes from bad MTAs, which some have suggested is not
well-supported.

"Some have suggested" is language that applies to any interesting
topic, for every possible point of view on the topic. Hence, it does
not carry any useful information. (Yeah, I am saying that a bit
sharply. It is a pet-peeve of mine about so-called news reporting and
I really hope the content-free utterance does not seep into serious
technical discussions.)

To move this particular point into something that might be productive,
please refer to the thread "who are we accrediting?" and note John
Levine's posting.  I'll post a response to it.

In the part you quoted, I was trying to point out one area of disagreement,without actually taking sides (I explained my own opinion after the "Myopinion" tag), so my apologies for the vagueness.

So, to be clear, I am suggesting that this assumption is not valid oruseful. I think the reputation of the MTA is often interesting, butcertainly not enough in itself to judge the quality of the mail. But,after reading your message I am starting to think that you don't believethis assumption either, that there are only "good" and "bad" MTAs. In thatcase, this disagreement is not directed at you, but at other CSV supporters(Matthew and Doug) who have been suggesting that checking HELO against areputation is pretty much all you need and other proposals that check otheridentities are worthless, doomed to failure, or both.

GC> I think checking the HELO *alone* is not an adequate solution to the
GC> problem set.

I agree.  RFC2822 author/sender based accreditation is also going to
be needed.  The nature and form of that accreditation is a different
question.



Right...  that's what I was trying to say too :)

GC> If the main thing we want out of MARID is to stop people forging mail

I do not have access to the working group charter as I write this, but
I sure hope that forgery is not the primary concern of the working
group.

Otherwise, there is a rather large community of email users and
providers who are going to rather upset that we spent all this time
and did nothing that is intended to reduce spam.

On the other hand, I could see how "DNS-based MTA authentication" could
cause one to think that forgery is the focus.

Wow, we're interrupting mid-sentence now, I see :) I won't spend much timeon this one other than to say:1. I want to stop spam too, and I think stopping forgery is a necessarybut sufficient step..2. I honestly believed that stopping forgery was the point of the WG andthat stuff that stops spam by other means than stopping forgery would beruled out of scope, and3. It looks like you agreed with the important part of my sentence anyway:)

GC> apparently-from and bouncing-to our own domains, a MAILFROM/PRA check
is GC> going to be required.

Some sort of rfc2822 author/sender accredition is going to be
required... in some cases.



Agreed :)

GC> Mechanically, CSV and SPF are both capable of checking HELO.

Mechanically, CSV and SPF are both fruit. But let me tell you, you do
not want to think about or use durian the same way you think about and
use oranges.

However, your statement highlights a deeper problem in most of the
efforts to discuss CSV and SPF differences:  Such efforts are almost
entirely tied to mechanical and syntactic issues and do not focus on
underlying concepts.

Right... That actuall IS what I mean here -- I mean to separate themechanics of each proposal from the underlying concepts. The assertion Iwas trying to test is whether the mechanism SPF uses to test PRA, MAIL FROMand HELO is capable of doing the same things the CSV mechanism does.

If it cannot for strictly *mechanical* reasons, I would like to understandwhat they are. So far the answer whenever I ask this is "Well, you COULDuse SPF TXT records instead of SRV records, but why would you want to?" Ifit's possible to present an end user with one tool that has twoapplications, that might be a worthwhile goal. Speaking only of the*mechanism* I don't see that SRV records have an inherent advantage overTXT records, or that the underlying concepts of CSV depend on SRV records.

CSV and SPF are fundamentally different pardigms.

    CSV vets an MTA's traffic.

    SPF vets an RFC2822 author/sender's message.

They are orthogonal informational-theoretic areas of consideration.

Where the confusion comes in, of course, is that SPF involves the MTA,
albeit through an indirection.

Let's try for some concise descriptions of the two paradigms:


SPF:

    Per-message MTA path validation, based on Author/Sender
    authorization and accreditations.

CSV:

    MTA traffic validation, based on MTA operator authorization and
    accreditation.


SPF vets an MTA's sending a single message.  It accredits the MTA
based on the RFC2822 author/sender.  While introducing a
path-dependency into the mechanism, it simply defers the hard
question, namely accrediting the author/sender.

CSV vets an entire MTA session.  It accredits the MTA based on the
operator of that MTA.

I don't really agree that CSV and SPF are fundamentally differentparadigms. They are different, but I don't think fundamentally so, and Idon't think either of them represents a "paradigm" really.

I think of SPF not as a great idea, but as a collection of great ideas.Some of these are:

- A mechanism that maps (domain name, IP) onto (pass, fail, unknown)

- Application of this mechanism to MAIL FROM, to vet a message path (orpartial path, when forwarders use SRS)- Application of this mechanism to PRA, to vet a message path (or partialpath, when forwarders use recommended headers)- Accreditation can be applied to the domain of any ID that returns passresult.- An ID that returns fail result should be treated as highly suspect andprobably rejects. An ID that returns unknown result should not be used tojudge a mail as good or bad and the receiver should fall back to othermethods.


CSV is also made of multiple great ideas, such as:

- A mechanism that maps (domain name, IP) onto (allowed, disallowed,no_info)

- Application of this mechanism to HELO, to vet an MTA

- Accreditation can be applied to the MTA based on its name, if result isallowed.- An IP address specifically disallowed from using the name claimed inHELO should be treated as MTA-not-grata- An MTA that has no info CSV may check should be rated on other means(e.g. IP) or not at all.

The point of this exercise is to separate the "mechanism" carrying themessage from the content and meaning of the message itself. The SRV recordmechanism is clever, but I got the feeling from reading CSV documents andspeaking to you and other CSV supporters that it is not the main importantthing that CSV does.

By suggesting that the mechanisms *could be* compatible, I don't mean toimply that the two types of checks already mean the same thing. Theydon't. SPF has a couple of modes where it checks HELO, but it lacks anexplanation as to why one might want to do that, what the informationmeans, and how to interpret it and act on it.

Why am I so keen to show that one mechanism could be used for both checks?Well, one of the first things that this WG worked on was deciding whichidentities to check. My understand was, at the time, that there was apretty strong consensus that we should work on both 2821 and 2822identities, and I *thought* we had also decided that if we tackle oneidentity first, we would do so in such a way that the other identitiescould be checked with the same or similar mechanism.

Current whitelist and blacklist services focus on the MTA network, ie,
the operator of the MTA.  So CSV provides a standardizing mechanism
for existing practise.

The limitations of that practise are demonstrated every day, but so
are the benefits.



That is an excellent point, and I agree.

GC>  - If the MTA name is also used as a HELO name for one of the MTAs
GC>       - In most cases the existing SPF record should be sufficient,
since GC> it probably includes that MTA.

My guess is that you are talking about the narrow case in which the
RFC2822 author/sender has the same domain name as the MTA HELO.  While
a popular scenario, it is a long way from being the ONLY popular
scenario.  And that's the problem. SPF is problematic for a number of
other such popular scenarios.



You are correct, that should have been "sender domain name" not MTA name.

I agree, this is definitely not the majority case. I mention it herebecause it is really the only case where the SAME name may be used by bothemail addresses and HELO. If the same name might be used by an MTA and bythe RHS of an email address, I think chances are very good that the allowedIPs for both cases will be the same.

Again, this hearkens back to the discussion of which identities we want tobe able to validate. At the time, we identified HELO, MAIL FROM andFrom:/Sender:, and along with the idea that perhaps all three meritchecking, we brought up the cases where the same domain name might be usedin different contexts. Each context might have wildly different meaningand usage, but where the NAME is exactly the same, the set of authorizedIPs would usually be the same or a blend of the two usage sets would besuitable. If I remember correctly, not everyone was convinced at the timethat a single set of IPs would always work, so there was some discussion ofa "scope tag" of sorts, but I think most of the group agreed at the timethat the need for this would be rare.

GC> Semantically, there is some difference in the understanding between
what GC> the CSV check means, and what the SPF+HELO check means.

It is rather more than "some".

Agreed. But if the implication is that they are different enough to*require* different mechanisms, I would not agree with that.


Let me say this again because I think it is important:
THE IDEA OF USING ONE MECHANISM TO VALIDATE DIFFERENT IDENTITIES IS NOT NEW.

As I continue to suggest that CSV *could* be implemented using SPF TXTrecords, people continue to look at me as if I'm speaking heresy. All Ican say is, please review the archives. This same WG agreed that multipleidentities are worthy of checking, and if possible they should be checkedin the same or similar ways. Did I misunderstand, or have we changeddirections on this, or has everyone just forgotten what we talked about forthe first month or more?

GC> It would be better to use ?include:comcast.net or ?ptr:comcast.net.
That GC> way the mail from those domains is still allowed, but not
"guaranteed" to GC> be from you.

This begs for an obvious question: What is the benefit to the
anti-spam world of something which offers no guarantees? Is that not
the same as saying "I enforce no anti-spam policies, since anyone can
claim to be part of my domain"? No accountability is a rather serious
deficiency.

To quote Douglas Adams, "We demand rigidly defined areas of doubt anduncertainty!" :)

Seriously though, the "unknown" state was put in there for a reason. In anideal world, all my users would phone home and submit with SMTP AUTH andall our mail would go out the pre-defined block of IPs. But, some domainowners might want partial coverage, and might need some usage cases to besupported in "legacy mode" for a while. If a domain owner is not 100% surehe has rounded up all the roaming users, he may choose to write ?all at theend - in which case forgeries would not be stopped, but the +entries in thelist can still be used to invoke reputation and whitelisting. If all theroaming users happen to be on comcast.net, a record with ?ptr:comcast.net-all is much better than ?all -- possibly enough to make spammers/forgersmove on to the next target.

In other words, the "unknown" state is a feature, not a bug. If you don'tagree, fine, don't use the feature. Your characterization of this mode asa "deficiency" is uncharitable and seems to contain a high FUD to factratio.

If you had not taken the sentence out of context, my original intent wouldbe a bit clearer -- I was actually responding to some other FUD based on awrong understanding of SPF (or intentional misreading or other straw man).The example given by Doug and Matt both was "Well what about a domain thatpublishes include:comcast.net? That means anyone on comcast.net could HELOas my own name!" Yes, and this would be a mistake on the part of thedomain owner; they are in effect saying "We trust comcast.net to not forgemail from us or otherwise use our name improperly."

GC> If the result comes back unknown, you can't attach reputation
GC> or whitelisting to that transaction, you just have to proceed in
"legacy" GC> mode.

And the value-add of SPF, in this scenario is what, exactly?

What does the administrator of the domain and/or the operator of the
receiving SMTP client get for their effort?



See above regarding FUD.

I will note also that despite disparagement pointed at the "unknown" modeof SPF, CSV also has a de-facto "unknown" mode - you can just choose not topublish any records at all for that particular name. I would assume thatreputation would not attach in this case either.

Is it clear to you that CSV has definite security advantages
over SPF/Sender-ID?



GC> There is general agreement that the smaller problem
GC> of HELO checking

"smaller problem"?

I hope you do not mean that identifying spam spigots is a small
problem or that doing it will be a small benefit.

That is one of the things CSV is useful for, that SPF is not.  Entire
networks of compromised machines can be blocked with a single
accreditation entry, no matter what the domain names they use for their
rfc2822 author/sender.

Actually I was not referring to my own opinion as "general agreement" -- Iwas referring to the decisions of this WG as to which identities should bechecked. I believe it was agreed that 2821.MAIL FROM was most important,followed by 2822.From/Sender, and 2821.HELO was the least important of thethree.

I DO agree though; identifying spam spigots is a big problem, and doing itis a big benefit. You have made a good case for HELO checking. I'm notsuggesting that HELO checking shouldn't be done, and you're not suggestingthat it's a total solution in itself that trumps others. We may not be onthe same page but I think it's in the same book :)

GC> CSV may have a better security story, but I believe this is a direct
result GC> of deciding to include fewer features and less flexibility.

Methinks there is a lesson in protocol design, here.


GC> Regarding DDOS concerns, I think they can be solved by placing some
limits GC> on the amount of recursion possible and the total number of
queries needed GC> per mail message, and that should satisfy most
concerns.

Offhand, I am not sure what you mean and I am certain I do not
understand how it pertains to protection against DDOS attacks.

Sorry, I didn't mean "[all] DDOS attacks" -- this was a specific reply to aspecific concern in SPF.

I don't know why Matt and Doug chose to say over and over again how nastyand yucky and vulnerable SPF is, citing this as a reason why CSV is cooland wanted and necessary. I don't think SPF and CSV are mutually exclusiveand I don't think the "air of competition" serves any of us well. I thinkCSV contains great ideas and so does SenderID.

GC> I think there is enough consensus in the group that we need to
protect PRA GC> and/or MAIL FROM,

There is agreement that we need a mechanism that identifies and
accredits rfc2822 author/sender IN SOME CASES.



No comment at this time, Senator. :)

GC> and that HELO is of secondary importance.

I'm not sure whether you noticed, but there is a rather different tone
in the comments about HELO checking now than there was a month or so
ago.

I noticed. CSV has done a lot to bring HELO into the spotlight. This is agood thing. As long as CSV isn't trying to elbow other proposals out ofthe way, I don't have a problem with it.

In fact, Unified SPF is based pretty strongly on my efforts to get HELOplaced more prominently on SPF's radar screen. I have gone from not takingHELO seriously to actively preaching the gospel of HELO to spf-discuss and#spf on IRC.

I don't think HELO has eclipsed PRA/MAIL FROM/SUBMITTER in importance orutility.

GC>   Not everyone
GC> agrees with this, but I think a majority of folks think that HELO
checking

I am confused.  Were the chairs asking each of us to perform a rough
consensus assessment of the working group?

I was referring again to the rough consensus reached at our first-phasemilestone. I believe it was agreed that 2821.MAIL FROM was most important,followed by 2822.From/Sender, and 2821.HELO was the least important of thethree. Perhaps I misremember, but that's what I thought we said.

GC>   So, if we are going forward with PRA/SenderID or
GC> something like it, it should be easy enough to adapt it to HELO
checking as GC> well.

I'm sure we all look forward to the specification that satisfies your
expectation.

Being worked on. I think you will be pleased. As I said before, ifUnified SPF borrows some ideas from CSV, please consider that a form offlattery :)


--
Greg Connor <gconnor(_at_)nekodojo(_dot_)org>