Re: Use of New Mask Mechanism

David MacQuigg wrote:

At 04:06 PM 3/26/2005 -0500, Radu wrote:
David MacQuigg wrote:
At 02:45 PM 3/26/2005 -0500, Radu wrote:
David MacQuigg wrote:
At 01:33 PM 3/26/2005 -0500, Radu wrote:
David MacQuigg wrote:
At 11:53 AM 3/26/2005 -0500, Radu wrote:
Once again, the mask would not work as a mechanism (unless itwas in include, like Frank mentioned), because each mechanismcan return a match. the mask modifiers can return a match onlyafter *all of them* have been checked against the IP. Think of amask like m=65/6 m=214/6. For senders in the 214 net, yourproposed -!ip4 mechanism would wrongfully declare the 214sources as "FAIL".
Oops. I thought I understood these masks, but I missed it. OK,so what this "mask" mechanism really says is, the IP address mustmatch one or another mask range AND match at least one of thesubsequent mechanisms.
Almost :)
Please, let's never call it a mechanism again, to avoid confusion.It really is a *modifier* !!! Actually a *set of modifiers* areonly meaningful together. Individually, each mask modifier doesn'tmean anything, because it doesn't tell enough to allow the checkerto stop evaluating.
Good point.  It's a modifier, not a mechanism.
And what it really says is: Somewhere in the included/redirectedrecords, there are more IP mechanisms that match some of the IPsin the mask range. It's a "summary" of the remaining records, ifyou wish. The summary includes more IPs than the recordsthemselves, but it serves well to tell authoritatively what IPs*aren't* in the subsequent includes. It also serves to tell youwhat the all at the very end of the record chain says the to dowith unmatching IPs ("fail", "softfail", etc), so that you don'tneed to scan the whole chain to find out what the domain ownerwants you to do.
Somehow I/we need to find a description of this that would be veryclear, so that implementers of SPF checkers know what to do. It isclearly a description/language problem because you're not alonegetting a grasp on its meaning.
How about
   record           = version [mask] terms *SP
   version          = "v=spf1"
   mask             = *( 1*SP m= ipblock )
After processing the mask, if there is any match, proceed with theterms as usual. If there is no match, skip processing any termsexcept 'all', and don't call for any remainder of a truncated record.
Thats good, except the earliest the mask can be evaluated is afterthe A/MX/exists mechanisms. For instance, an exists:{i}.domain.comcannot be covered by a mask. The only reason why a compiler mightleave a/mx/exists mechanisms in the compiled record is if theycontain macros that cannot be expanded at compile time (such as i,l, and possibly s).
I was planning on worrying about truncated records. I was going toassume that the checker cannot receive a truncated record, becausethe resolver library takes care of using TCP as necessary. In anycase, for compiled records this should only happen if the compilerscrewed up and created a record that is cannot fit in a UDP packet.I would treat that as a bug that must be fixed, as opposed to acondition that must be handled by the checkers.
So, since I'm assuming that the checker receives the entire record,the masks could be evaluated when the first include/redirectmechanism is encountered. This would mean that the mask does notneed to cover IP addresses in the top record, as they are free tocheck.
Now I'm confused. If the reason for masks is *not* to avoid sendingmultiple packets, and *only* to avoid processing mechanisms thatrequire another lookup, why do we need these lookups on the clientside? Why can't the compiler do whatever lookups the client woulddo, and make the clients job as simple as possible?
Sorry for creating confusion.

Say that you have a policy that compiles to 1500 bytes.

The compiler will split it into 4 records, about 400-bytes each or so.

example.com     IN TXT \
     "v=spf1 exists:{i}.{d} ip4:... redirect=_s1.{d2} m=-65/8 m=24/8"
_s1.example.com IN TXT "v=spf1 ip4:.... .... ....  redirect=_s2.{d2}"
_s2.example.com IN TXT "v=spf1 ip4:.... .... ....  redirect=_s3.{d2}"
_s3.example.com IN TXT "v=spf1 ip4:.... .... ....  -all"

We want the mask to be applied after the exists:{i}.{d}. Since that
mechanism was in the initial query, cannot be expanded to a list of IPs
the mask cannot possibly apply to it.
I think what you are saying is that the compiler can't get this down toa simple list of IPs, because we need redirects containing macros thatdepend on information only the client has. So if we are to put theburden of complex SPF evaluations on the server side, where it belongs,seems like we have to pass all the necessary information to the serverin the initial query. We already pass the domain name. Adding the IPaddress should not be a big burden, and it would have some otherbenefits we discussed.

If you can find a way to do that and still keep the query cacheable, letme know. If it is compatible with the way DNS works currently, I'll evenlisten and pay attention. ;)

That 1 UDP packet might not seem like a lot. But currently it iscacheable and most of the time is not even seen on the internet. Makingit uncacheable would be a multiple fold burden on bandwidth. That'sexactly why caching and the TTL mechanism was invented, and now yousuggest we give it up?

Maybe I'm just not seeing the necessity of setups like the aboveexample.com. I'm sure someone could come up with a scenario where itwould be real nice if all SPF checkers could run a Perl script embeddedin an SPF record, but we have to ask, is that really necessary to verifya domain name?

The "..." imply a list of ip4: mechanism that is 400-bytes long. That'swhy the chaining is necessary. ebay.com has something like that.hotmail.com uses something similar too. When you have lots of outgoingservers, you need more space to list them, no?

If we simply can't sell SPF without all these whiz-bang features, Iwould say put it *all* on the server side. All the client should haveto do is ask - "Hey <domain> is this <ip> OK?" We dropped that ideabecause it doesn't allow caching on the client side, but with a simplePASS/FAIL response, the cost of no caching is only one UDP round tripper email. This seems like small change compared to worries aboutrunaway redirects, malicious macros, etc.


I'll humour you:

This server-side processing would not be happening on a caching server,correct? That would not save anything. I hope you agree.

So the only place where it might make a difference is if the evaluationwas run on the authoritative server for the domain.

The problem with that, is that authoritative servers are designed withperformance and reliability in mind (as opposed to caching servers,which care more about cost savings). As such, the auth servers *do not*do recursive queries, as an SPF record evaluation might be. They also donot do any caching. They respond to every query with a piece of datathey already have in memory or on disk. If they don't have that piece ofinformation, they return an empty response or "it doesn't exist"(NXDOMAIN). They never look for it anywhere else. That's why they areauthoritative. If they don't know about it, it doesn't exist.

Now, the spfcompiler only makes sense if it is running on a masterserver. Itself the master for a zone is authoritative. The aboveauthoritative servers are slaves. They take the information from themaster server and disseminate it as if it was their own. It is theadminstrator of the master zone server that allows them to do so. Noother server can respond authoritatively to queries for the zone inquestion.

So, the only place the spf compiler makes sense is on the master server,because ultimately, it is the only one who really knows the facts. Whenthe facts change, the master informs the slaves, which do a zonetransfer in order to update their databases. So the truth propagatesnearly instantly from the master to the slaves, and as such the slavescan be seen as clones of the master, identical in every way, except forthe 3 second delay it takes them to transfer the zone files.

You cannot run the compiler on the slaves, because they might eachevaluate the record differently, as they are coping with differentnetwork conditions (such as lost packets, etc). In that case, they wouldeach tell a different "truth" than each other and than the masterserver. In that case they would no longer be authoritative.

Now, having the master zone server respond to queries that require it todo calculations of any kind is an absolute no-no. That is because nomatter how big the zone is (yahoo, rr, whatever), there is only onemaster. Ok, there may be a couple, but their job is not to respond toqueries, but to 'hold the authority'. The slaves are for responding toqueries.

So doing what you propose would require the DNS system to be turnedupside down. The justification of SPF is just not good enough.

How about this: All SPF records SHOULD be compiled down to a list ofIPs. If you need more than that, then do as much as you like, but givethe client a simple PASS or FAIL. Most domains will then say "Here isour list of IPs. Don't ask again for X hours." Only a few will say"Our policy is so complex, you can't possibly understand it. Send usevery IP you want checked."

That's exactly what the exists:{i}.domain does. It tells the domainabout every IP it wants checked, and the server checks it.Unfortunately, it is extremely expensive because it's AGAU.

I need to get back to designing ICs. :>)

Nah... you've got some great ideas and I value your contribution andfeedback.

Just that this one isn't one of the good ideas. That is only my opinion,of course. But if you can prove that it's good, I'm listening.


Radu.