Re: Need for Complexity in SPF Records

David MacQuigg wrote:

Radu, I wrote this response yesterday, then today decided it doesn'tsound quite right. I'm really not as sure of what I'm saying as itsounds. Show me I'm wrong, and I'll re-double my efforts to findsolutions that don't abandon what is already in SPF, solutions like yourmask modifier. Examples are the best way to do that. Your example.combelow is almost there, but it still doesn't tell me why we really needexists and redirect.

Ok, we'll have a look at all the ideas on the table. That's what thetable is for, right ? :)

I won't cut anything out of your message, so that the progression of theexplanation is easily seen and reflected upon if necessary.

At 07:21 PM 3/26/2005 -0500, Radu wrote:
David MacQuigg wrote:
At 04:06 PM 3/26/2005 -0500, Radu wrote:
David MacQuigg wrote:
Now I'm confused. If the reason for masks is *not* to avoidsending multiple packets, and *only* to avoid processing mechanismsthat require another lookup, why do we need these lookups on theclient side? Why can't the compiler do whatever lookups the clientwould do, and make the clients job as simple as possible?
Sorry for creating confusion.

Say that you have a policy that compiles to 1500 bytes.

The compiler will split it into 4 records, about 400-bytes each or so.

example.com     IN TXT \
     "v=spf1 exists:{i}.{d} ip4:... redirect=_s1.{d2} m=-65/8 m=24/8"
_s1.example.com IN TXT "v=spf1 ip4:.... .... ....  redirect=_s2.{d2}"
_s2.example.com IN TXT "v=spf1 ip4:.... .... ....  redirect=_s3.{d2}"
_s3.example.com IN TXT "v=spf1 ip4:.... .... ....  -all"

We want the mask to be applied after the exists:{i}.{d}. Since that
mechanism was in the initial query, cannot be expanded to a list of IPs
the mask cannot possibly apply to it.
I think what you are saying is that the compiler can't get this downto a simple list of IPs, because we need redirects containing macrosthat depend on information only the client has. So if we are to putthe burden of complex SPF evaluations on the server side, where itbelongs, seems like we have to pass all the necessary information tothe server in the initial query. We already pass the domain name.Adding the IP address should not be a big burden, and it would havesome other benefits we discussed.
If you can find a way to do that and still keep the query cacheable,let me know. If it is compatible with the way DNS works currently,I'll even listen and pay attention. ;)
That 1 UDP packet might not seem like a lot. But currently it iscacheable and most of the time is not even seen on the internet.Making it uncacheable would be a multiple fold burden on bandwidth.That's exactly why caching and the TTL mechanism was invented, and nowyou suggest we give it up?
No, I see your point. If we truly need %{i} macros, and we evaluate themon the server side, that would produce a different response record forevery IP address, and it might not make sense to cache such records.Responses for SPF records with no %{i} macros would cache as always.The %{d} macros would not impair caching. Even the %{i} responses mightbe worth caching for a few minutes, if you are getting hammered by one IP.

Actually, all records should have the longest possible TTL (within theconstraints of the network design). This avoids caching name serverseverywhere asking the same queries too often.

Responses to %{i} queries are no different. Since there are 2^32possible questions, you want each one to come up as infrequently aspossible. If you have a pest or even regular traffic every hour, butyour %{i} TTL is 59 minutes, then the cache efficiency is 0%. But if youcould make it 1h and 1 minute, the cache efficiency would be 50%. On theother hand, for steady traffic, the cache efficiency would be reallyhigh, so even a lower TTL would not make much difference, as the savingsare huge compared to the cost. tt's a little bit counter intuitive thatthe "uncacheable" records should have long TTLs. Anyway, this issomewhat philosophical, because you can't cache 2^32 * {number of forgeddomains that publish %{i}}.

As an example, lets pretend that yahoo publishes a record with %{i} anda TTL of 10 minutes. Potentially it will receive the same 2^32 questionsfrom all the caching servers of the world, every 10 minutes. I know forsure that ohmi will be asking every 10 minutes, because I get lots offorgeries as yahoo.com (say 1 every 11 minutes). So will all the otherlittle servers. So doubling that TTL means I'll only ask every 20minutes. This is where the damage is, little servers asking for theinformation every 10 minutes, but never using it more than once.

But when yahoo users send 300M messages a day to their hotmail friends,hotmail will ask yahoofor the information 144 times, and use it's cachethe other 299,... million times. So the cost of %{i} as seen by yahoo isnot coming from hotmail querying it, from the swarm of little serverseverywhere.

Whether the loss of caching on a few records is too high a price dependson the severity of the threatened abuse. Should we tolerate a smallincrease in DNS load for the normal flow of email, to limit theworst-case abuse of the %{i} macro. I don't know.

Well, the %{i} is not a small increase. It is even far more expensivethan PTR. Let's say that you have a spewing spambox that uses forgerytechniques. (let's say it's at 1.1.1.1)


Let's say that all domains used one %{i} mechanism.

The spambox sends ohmi N forgeries from different domains.

If every domain listed a PTR mechanism, I would query the 1.1.1.1.ARPAadress once, and for the remaining N-1 queries I would find it in thelocal cache. So my cost of the PTR is 1 query per mail source.


But if everyone uses an %{i}, I now have to ask the following questions:

1.1.1.1._spf.domain1.com
1.1.1.1._spf.domain2.com
1.1.1.1._spf.domain3.com
1.1.1.1._spf.domain4.com
...
1.1.1.1._spf.domainN.com

these are distinct queries, and I only ask each question exactly once,so the fact the the local DNS cache does cache the answers, I will neverask for them again. All that traffic will go over my DSL connection tothe ISP to the root servers, and so on. Actually, as Tod pointed out,every time my caching server is asked about a new domain, it generatesmultiple recursive queries: 1st one to the root servers, 2nd to theauthority NS servers, 3rd one the the subdomains and so on. I hadn'tthought about this, or I would have presented a much gloomier SPF-doomscenarion.

So every one of those queries costs 3 queries on my DSL line. 3*N.Compared to the PTR mechanism that only costs 1 query across DSL. I havethe caching server on my side of the DSL modem, I don't use the ISP's. Ialso get charged for excess bandwidth consumed.

If I used the ISP's caching server, I would ask N questions even for thePTR case. The further the caching server is, the more expensive it is touse it. Also the benefit is lost, as the further it is, the higher theresponse latency gets. (Assume my DSL connection had a 200ms latency.Asking N questions would take N*200ms, while asking the same N questionsfrom a cache on my side of the modem would be 200ms for the 1stquestion, and 0.1ms for every subsequent one). And I'd pe paying dollarsfor the N*200ms performance.

What I *would* do is discourage the widespread use of macros, redirects,and includes, and state in the standard that processing of records withthese features SHOULD be lower priority than processing simple records.That may help to implement a defense mode if these features are abused.

Absolutely, I'm with you on this. I already suggested that the expensivemacros are to be limited to 1 per record. The d and o are not expensiveas they expand the same, no matter what the source of the connection isor what the claimed mail-from is.


I would not introduce the concept of 'priority' though.

After all, no-one is forcing postmaster to do 10 queries, or N queries.Even my sendmail implementation of SPF has configuration options for howexpensive the check is allowed to get. You can say that checks with %iare never done, and in that case the policy does not result in ananswer, and you can also configure the max number of DNS mechs to anarbitrarily low number. If it is lower than the spec, and the checkersees more than that in the record, it doesn't try to expand even one,and returns with "record too expensive". In both of those cases, noReceived-SPF header is added.

Maybe I'm just not seeing the necessity of setups like the aboveexample.com. I'm sure someone could come up with a scenario where itwould be real nice if all SPF checkers could run a Perl scriptembedded in an SPF record, but we have to ask, is that reallynecessary to verify a domain name?
The "..." imply a list of ip4: mechanism that is 400-bytes long.That's why the chaining is necessary. ebay.com has something likethat. hotmail.com uses something similar too. When you have lots ofoutgoing servers, you need more space to list them, no?
Why can't they make each group of servers a sub-domain with its ownsimple DNS records, as rr.com has done with its subdomains?_s3.example.com can have as many servers as can be listed in a 400 byteSPF record, and that includes some racks with hundreds of servers listedin one 20 byte piece of the 400 byte record. With normal clustering ofaddresses, I would think you could list thousands of servers in eachsubdomain, with nothing but ip4's in the SPF record.

It may already be that way. If I had that longer list of domains thatpublish SPF, I could run the spfcompiler on then and find out veryquickly what the average, min and max compiled record lengths would be.

One reason I can see why mail server's can't be clustered too tightly isin an application like ebay's. Their business depends on being able tosend "last chance" emails, so they have to have mail servers sprinkledall over for redundancy (load sharing too).

As I understand it, users sending mail from _s3.example.com will stillsee 'example.com' in their headers, but the envelope address will be thereal one _s3.example.com. That's the one that needs to authenticate,and the one that will inherit its reputation from example.com.

I'm afraid you misunderstood. The _s3-like names are generated by thecompiler, but nothing in the configuration of the SMTP server is changedto reflect it. So if the next version of the compiler changes to using_p3, there is zero effect on the the mail users. Because the _s recordsare daisy chained, it's only the root of the chain that can be used asas start of policy. That root is at domain.com.

Also, as the network changes, the contents of _s3 changes too. Maybe thewhole daisy chain gets shorter or longer. That will not affect theenvelope address used on mail. Evaluation must always start atdomain.com (top of daisy chain)

Seems to me this is using DNS exactly the way it was intended,distributing the data out to the lowest levels, and avoiding the need toconstruct hierarchies within the SPF records. Sure, it can be done, butwhat is the advantage over just putting simple records at the lowestlevels, and letting DNS take care of the hierarchy? Why does ebay.comneed four levels of hierarchy in its SPF records?

Currently just for convenience, as they're not using any compiler. Inthe future, the compiler will flatten the hierarchy. It may be a whiletill then, so in the meanwhile we need a transition plan.

If we simply can't sell SPF without all these whiz-bang features, Iwould say put it *all* on the server side. All the client shouldhave to do is ask - "Hey <domain> is this <ip> OK?" We dropped thatidea because it doesn't allow caching on the client side, but with asimple PASS/FAIL response, the cost of no caching is only one UDPround trip per email. This seems like small change compared toworries about runaway redirects, malicious macros, etc.
I'll humour you:
This server-side processing would not be happening on a cachingserver, correct? That would not save anything. I hope you agree.
If the caching server were in the domain which created the expensive SPFrecord, then it would save traffic to and from the client, at theexpense of traffic within the domain that deserves it. If example.comneeds 100 queries within their network to answer my simple query "Isthis <ip> OK?", then they need to think about how to better organizetheir records. All I need is a simple PASS/FAIL, or preferably a listof IP blocks that I can cache to avoid future queries. ( This should bethe server's choice.)


I see where the misunderstanding started. Let me attempt to clear it up:

Caching servers are rarely/never deployed close to the authoritativeservers. Caching servers really only make sense if they are close towhere the queries are generated. I showed this above with my 200ms DSLconnection example. It was a little exagerated, but it serves thepurpose of explanation well.

Caches generally are most beneficial when they are closer to theconsumer. The principle applies equally to processor L1 caches, diskcaches, HTTP page/GIF caches.


The processor caches offer a great example:

The L1 cache runs at the same speed as the core, so assuming a processorspeed of 1GHz, every read and write which is a cache hit costs 1ns. Ifthe data is not found, the request goes to the L2 cache, which isbigger, but much slower. So now every request that ends up to the L2cache takes maybe 5ns. So if the CPU is running on L2 data, it iswaiting 80% of the time. If the data is not in L2, the request goes tomemory. It now takes at least 100ns to do a cache-line fill from memory,so the CPU is waiting 99% of the time when reading from RAM. The nextlevel is the disk-based swap space. It's the next best thing tore-reading a file, especially if it is a file on a network drive. If itneeds to run of swap space, we all know that it's just not worth runningat that point, that's how slow it is.

In the CPU world, slow is expensive. It's like having a 3GHz machinerunning on swap data. The MIPS/dollar proposition is pittyful.

The same thing applies to networks. In the example I gave above - 200msDSL - waiting N*200ms instead of 200ms + (N-1)*0.1ms is slow, andtherefore expensive, because now I cannot check 1000 domains per second,but only 5.

What I *don't* want in answer to my simple query, is a complex script torun every time I have a similar query. That seems to be the fundamentalsource of our problem. SPF needs to focus on its core mission,authenticating domain names, and doing just that as efficiently andsecurely as possible. All these complex features seem to be motivatedby a desire to provide functionality beyond the core mission -constructing DNS block lists, etc. Now we are finding that the complexfeatures are not only slowing us down, but have opened up someunnecessary vulnerabilities to abuse.

Unfortunately, the java, javascript, flash, and others found the modelof scripts running on the client rather than the server to be muchbetter than scripts running on the server.

But we should make the distinction between expensive scripts and cheapscripts. All those web-enabling technologies are scripts that getdownloaded in one shot, and then run continuously without needing tocommunicate with the server again. That makes them cheap.

Analogously, cheap SPF scripts (IP lists) are much better than expensivescripts (DNS mechanisms), where the entire work is in transferingtidbits of data across the net.

The flash format would have failed if it needed to request each polygonfrom the server individually, and serially.

So the only place where it might make a difference is if theevaluation was run on the authoritative server for the domain.
The problem with that, is that authoritative servers are designed withperformance and reliability in mind (as opposed to caching servers,which care more about cost savings). As such, the auth servers *donot* do recursive queries, as an SPF record evaluation might be. Theyalso do not do any caching. They respond to every query with a pieceof data they already have in memory or on disk. If they don't havethat piece of information, they return an empty response or "itdoesn't exist" (NXDOMAIN). They never look for it anywhere else.That's why they are authoritative. If they don't know about it, itdoesn't exist.
Now, the spfcompiler only makes sense if it is running on a masterserver. Itself the master for a zone is authoritative. The aboveauthoritative servers are slaves. They take the information from themaster server and disseminate it as if it was their own. It is theadminstrator of the master zone server that allows them to do so. Noother server can respond authoritatively to queries for the zone inquestion.
So, the only place the spf compiler makes sense is on the masterserver, because ultimately, it is the only one who really knows thefacts. When the facts change, the master informs the slaves, which doa zone transfer in order to update their databases. So the truthpropagates nearly instantly from the master to the slaves, and as suchthe slaves can be seen as clones of the master, identical in everyway, except for the 3 second delay it takes them to transfer the zonefiles.
You cannot run the compiler on the slaves, because they might eachevaluate the record differently, as they are coping with differentnetwork conditions (such as lost packets, etc). In that case, theywould each tell a different "truth" than each other and than themaster server. In that case they would no longer be authoritative.
Now, having the master zone server respond to queries that require itto do calculations of any kind is an absolute no-no. That is becauseno matter how big the zone is (yahoo, rr, whatever), there is only onemaster. Ok, there may be a couple, but their job is not to respond toqueries, but to 'hold the authority'. The slaves are for responding toqueries.
I would also say the slaves are the right machines on which to dowhatever complex lookups are needed to answer a query. The owners ofthose machines are the only ones who will make the tradeoff of cost vsdesired complexity.

I actualy said that you cannot run the compiler (ie, complex evaluatorprogram), so I will disagree here, but I will explain in more detail.

It is a common best practice for a domain name to employ slaveauthoritative servers that are well spread around the world. This is soif one trunk gets cut somewhere, the domain name does not suffer, as itis able to serve queries from its redundant servers. (when a resolv callfails, it tries the next authoritative server on the list ofauthoritative servers for a domain).

As such, the slaves for a domain are separated by great geographicaldistances, and this makes the whole system more reliable.

But since they are separated, if you ask them all to resolve the sameSPF record independent of each other, they will come up with differentanswers. This is because different queries time out for each one, andthey are asking different other servers for the answers to the questions.

For instance, say that an ISP has 2 name servers: ns1.isp.com andns2.isp.com. A customer of that isp has a vanity domain, or a largecompany. So the customer uses an include:smtplist.isp.com in its own SPFrecord.

The customer uses 5 slave name servers, which are different than theISP's name servers: ns1.dnsRus.com, ns2.dnsRus.com, ns1.weknowdns.comns2.weknowdns.com, ns3.weknowdns.com.

If the slaves compile the include:smtplist.isp.com mechanism, they mightcome up with different results, because they would do the compile atdifferrent times. Indeed, if the ISP needs to change the TXT atsmtplist.isp.com, it might take a minute or two for the change topropagate to ns1.isp.com and 5-6 minutes to propagate to ns2.isp.com.That may depend on how busy each of the servers is, and theirconfigurations, which may not be the same, etc.

So in the case in the large company's compiled SPF record, some of the 5name servers ask ns1.isp.com and some ask ns2.isp.com for the TXT recordto smtplist.isp.com. Oops!! The slaves now compiled different SPFrecords for the big company. And slaves don't ask each other forconfirmation. They are authoritative, so they all *know* that their infois correct.

That's the problem, right there... authoritative servers presentingdifferent information as correct.

When they designed the DNS system they avoided this problem by design.That's why the slave servers are called slaves, because they're supposedto do nothing but what the master tells them. Then the chain of commandis intact. So a thinking slave is an oxymoron by design in this case.You cannot ask the slave to do any thinking/compiling, as that wouldbreak the assumptions that the whole DNS system is based on.

So, the only place that compiling can be done by a DNS server is at themaster servers. They are abosolutely authoritative, and there's no riskof disagreeing with other servers.

I fact, the master servers for a zone use the same source of information(database, zone file, etc). When that file changes, they read it in andinform the slaves. Then, after the necessary propagation delays all theslaves are updated and respond with the exact same information.

Sometimes the source of information is a database, like in the MyDNSserver which uses MySQL. MyDNS is purely a master server. It doesneither recursive queries (which would make it caching), nor does itaccept incoming zone transfers (which would make it capable to be a slave).

When there are multiple master DNS servers, using the same database,they have to use database replication, so that all masters use the sameinformation. In that case, only one of the databases is writeable, andthe rest read-only. Even there, on the back end, there is this conceptof a master database, and multiple replicated copies. The compiler willrun on the master-of-masters, and update the master database. The updatewill then replicate to all the slave databases, which are used by theother master DNS servers, which will update the slave DNS servers, andeveryone is on the same page.

So if the compiler, running on the master-of-masters queries ns2.isp.com(which is a couple of minutes late to update), there is no problem, asthe master-of-master does the recompilation every TTL seconds (isp.comsetting), and the isp does not expect its records to propagate to theworld in less time than the TTL is specifies.

Of course if the TTL of the big company's records is less than that ofthe ISP, the SPF record will have that shorter TTL, and will be compiledmore often.

So doing what you propose would require the DNS system to be turnedupside down. The justification of SPF is just not good enough.
I don't see how this turns anything upside down. DNS is supposed to bedecentralized. If complex lookups are necessary, having a bunch ofslave servers do the work on behalf of a master server is consistentwith decentralization.

Well, compare what you are suggesting with the way I understand the DNSworld to work. From my perspective, your proposal is a departure fromthe way things work. If I'm wrong, I hope that someone moreknowleageable will give me a well-placed kick. (I promise to take itlike a man... gimme!) :)

Let's estimate the worst-case load on DNS if we say "no lookups, onepacket only in any response". I'm guessing 90% of domains will providea simple, compiled, cachable, list of IP blocks. This is as good as itgets, with the possible exception of a fallback to TCP if the packet istoo long. The 10% with really complex policies may have a big burdenfrom queries and computations within their own network, but what goesacross the Internet is a simple UDP packet with a PASS or FAIL.

Oh, but the critical detail is that a lot of firewalls block port 53TCP, whether by design or configuration. Since this is the state of theworld, DNS queries over TCP are inherently unreliable.

I doubt if the 10% of domains with long compiled SPF records will acceptthat unreliability as a fact of life. They will stick to UDP, which ismore or less guaranteed, in the sense that even if a packet is lostonce, the next time it will probably make it. The DNS system dealsgracefully with temporary problems like this, so not a problem.

But when your record depends on TCP, and some firewall somewhere blocksit, there's no amount of retrying that will get that connection through.

And because of this, we're stuck with daisy-chaing the longest records.In the end, it's done for the sake of reliability, at the expense ofsome extra traffic.

That response is not cacheable, but lets compare the added load to someother things that happen with each email. Setting up a TCP connectionis a minimum of three packets. SMTP takes two packets for the HELO andresponse. MAIL FROM is another two. Then we need two for theauthentication. At that point we can send a reject (one packet) andterminate the connection (4 packets).
Looks to me like the additional load on DNS is insignificant for normalmail, and only a few percent of the minimum traffic per email in a DoSstorm. Also, the additional load is primarily on the domain with theexpensive SPF records, where it should be.


This is not always the case. Consider a case like:

"v=spf1 ip4:1.1.1.1/28 mx:t-online.de include:isp1.com include:isp2.cominclude:isp3.com -all"

Say that the 3 ISP's don't even publish SPF yet, but the includes arethere just in case they ever do.

This record is very cheap on the publisher's DNS (only 1 TXT query goesto the publisher's DNS). But for every bandwidth penny spent by thepublisher, the 3 ISPs have to spend 1 penny each. Poor t-online.de hasto spend 10 pennies for each penny that the publisher spends.

And the sad thing is, while the isp's can minimize the cost bypublishing cheap SPF records, there's nothing t-online can do to lowerit's damage.

What's even worse, is that t-online can't even find out why it seesincreased bandwidth levels. It's extremely complicated to track an MX orA query back to an email address.

Even worse than that, the default max-ncache-ttl in BIND is 3 hours.That means that even if the publisher's TXT record has a TTL of 24H, theisp's will be hit with a query every 3 hours, while the publisher onlyevery 24H.


So no, the costs is not necesarily on the publisher.

Taking into account the TTLs above, and the TTL of t-online's records of1H, the score would be


1:8:240

So the publisher's record costs t-online.de 2.40euro for every pennycost to the publisher.


The ISP's pay 8 pennies each.

Even if this were a spammerdomain, and they weren't *really* doing any internal lookups, the loadon their DNS server is two packets for every additional two-packet loadon the vitims. No amplification factor here.

Add that the spammer is actually likely to both use the t-online.de'sresources, and be stupid enough to not realize that mail doesn't gothrough the MX exchange. Suddenly, the amplification factor becomes acertainty.

How about this: All SPF records SHOULD be compiled down to a list ofIPs. If you need more than that, then do as much as you like, butgive the client a simple PASS or FAIL. Most domains will then say"Here is our list of IPs. Don't ask again for X hours." Only a fewwill say "Our policy is so complex, you can't possibly understandit. Send us every IP you want checked."
That's exactly what the exists:{i}.domain does. It tells the domainabout every IP it wants checked, and the server checks it.Unfortunately, it is extremely expensive because it's AGAU.
If I were writing an SPF-doom virus, this is where I would start.
I need to get back to designing ICs. :>)
Nah... you've got some great ideas and I value your contribution andfeedback.
And I appreciate your time in getting me up to speed on these problems.I hope one day I can return the favor.

It's a pleasure to be of service. SPF is a good cause, and I think itdeserves to be saved.

Incidentally, I got curious and did some tests, and it appears thatyahoo does not do any DNS queries on incoming mail. Hotmail does two buteither doesn't respect TTLs, or do queries on a spot-check basis,because even though I have a low TTL, they did not refresh.

It could be that already, even without checking SPF these two figuredout that DNS is more expensive than storing spam. Fascinating!

This wasn't a scientific test as I would normally do, but a quickcheck-your-fears check.

So at least for now, I think I know that yahoo and hotmail will not doany spf checks any time soon, based on this little test and a lot ofextrapolation. ;)


Radu.