Re: Re: DNS load research



Andy Bakun wrote:

Would anyone be complaining "additional email transactions are bad for
the DNS! No one will want to send greater amounts of email once they
find out the load it puts on their DNS servers"?  What if MX queries for
SPF checking are considered exactly as expensive as MX queries for
sending email?  This is why I weighted mx low initially, because if MX
isn't cheap, then sending email isn't cheap to begin with.  But I could
see it going significantly higher.

Spam traffic is growing much faster than real email traffic. The realemail traffic grows roughly proportionally with the number of newsubscribers to the Internet, while (attempted) spam grows proportionallywith the effectiveness of SPAM filters. As long as the profitability ofspam remains even marginal, the levels of overhead handled by the MTAsand DNS will grow (much faster than the levels of email between friends,partners, mailing lists).

My weights were not solely based on some raw "number of queries" count,
but on how that mechanism fits into the whole system, just not DNS
(otherwise, the weights are nothing more than a query count, and we
should just count queries -- I'm trying to think about things
differently here).

I like the idea of weights, but it it is a purely academic exercise,because at run-time it is difficult or impossible to calculate the realexpensiveness of a record. The checker can try estimating it, but itwill probably not be nearly accurate enough to be useful. This isbecause of DNS caching.

In a large installation, the DNS cache and the incoming MTAs may bedifferent physical machines. The traffic between them is not veryrelevant because it's on the ISP's network, so it doesn't cost muchmoney, once they have a gigabit ethernet installed.

However, cache requests that result in traffic on the ISP's connectivityto the backbone do cost money.

It is therefore difficult if not impossible for the checking coderunning on the MTA machines to estimate if the query it is about to dotoo result in a packet sent to the backbone of if it will be served fromthe cache.

The average cost of the backboe traffic is proportional to thecomplexity we allow SPF to have. It can be estimated using weights forthe different queries, but this is a design-time estimate, as atrun-time it cannot be done reliably.

I still think the macros can cause far more backbone traffic than themechanisms themselves, because macros are much less likely to be cacheable.

I don't think anyone disagrees that individual senders need to determine
for themselves (perhaps, if you're an ISP who supports vanity domains,
with input from your customers who might use include) if they need to
make the robustness/load tradeoff in their records.

Vanity domain owners, who are two or three steps removed from theworries of running a DNS+MTA combo, will have no incentive to make theirrecord cheaper.

An SPF publisher can only lighten his DNS load a _little_ by making hisDNS simpler, because the bulk of his DNS load is still queries he has todo due to other people's SPF records.

Another thing that really bothers me is the potential for malicious'punishment':

Say that I really don't like domain X. My own email is forged likethere's no tomorrow, so lots of MTA out there execute my SPF record.

If I make my record "+ip4:my_ip -a:a.X -a:b.X -a:c.X ... and so on tothe allowed limit, my record will cost X dearly. I will never beblacklisted for doing this, because my denying X to send on my behalfappears legitimate. also, I'm not requesting a number of queries withinthe limit, so I'm not DOS's any recieving MTA.

What's even worse, is that poor domain X will have to answer all thosemillions of queries and not even know why he's queried so heavily.replace X with microsoft.com, and you will see that a LOT of peoplewould like to implement this strategy. But poor little ohmi.org will beput out of business very quickly if this happened to it.

What's even worse than that is that I can do something like "-a:%{i}.X"and "-a:%{i}.%{l}.X" (IP and username), and ensure that the queries arenot cacheable, so the victim is hit much, much harder.

You'll no doubt say that X will then implement some kind of blacklistingon their DNS server, and refuse to respond to queries from some IPs.


Very well, so now all those MTA's trying to resolve my record will:

A. time out 10 times while waiting for a response from X that will nevercome.B. will never be able to find out any NS info from X. So they can't findout what X's web server is, what it's MX record is, and so on. It willlook to the victim like X does not exist any more.


I want to be very very careful while standardizing something this dangerous.

Greetings,
Radu.