Re: Re: DNS load research

Andy Bakun wrote:

On Wed, 2005-03-23 at 17:57 -0500, Radu Hociung wrote:
That is not what Radu said!  He made no claim that the virus would stop,
just the DDOS attack.  In fact, he did not say the DDOS would stop, just
that the internet would be back on its feet.  Maybe not running, but
standing anyway.
Thank you Guy, that is exactly what I meant. In fact, removing the TXTrecords would only take the amplification factors away. At that pointthis virus would become as tame as any other.
Okay, I see what you're saying. Agreed.


Cool :)

I was thinking some more about the scenario, and I think that thefollowing conclusion did not came across very well:

The severity of the DDOS is a quadratic function of the query limitimplemented by the spf checkers.

Each of the two multiplication factors are linear functions of the querylimit. Since they get multiplied, the overal result is proportional tothe square of the query limit.

Indeed, if the query limit that ohmi.org would do were 10 queries, thenumbers would become (L=limit):


Traffic magnification: (2 * L * 100) / 60 = 33.33
Time magnification: (200ms * L) / 50ms = 40
Total amplification: 33.33 * 40 = 1,333
Aggregate DNS traffic: 1,333*1Mbit/s = 1.33Gbps

Here's a quick table I put together that shows the growth, for theassumed network conditions I specified (60-byte attack packet, 100-bytequery packets, 50ms connection time, 200ms DNS query round trip):


Limit   Traffic amplif. Time Amplif Total Amplif:
1            3.33            4          13.33
2            6.67            8          53.33
3            10.00           12         120.00
4            13.33           16         213.33
5            16.67           20         333.33
6            20.00           24         480.00
7            23.33           28         653.33
8            26.67           32         853.33
9            30.00           36         1080.00
10           33.33           40         1333.33
11           36.67           44         1613.33
12           40.00           48         1920.00
13           43.33           52         2253.33
14           46.67           56         2613.33
15           50.00           60         3000.00
16           53.33           64         3413.33
17           56.67           68         3853.33
18           60.00           72         4320.00
19           63.33           76         4813.33
20           66.67           80         5333.33

The existing records that max out due to loops which are not easilydetectable, combined with the existing checkers based on the currentversion of the draft (111 query limit) make a dangerous combination, inmy view. Fortunately they are relatively few.

I think that in selecting a limit for the SPF draft we need to keep inmind how a potential DDOS would scale.

If DNS servers would include a record compilation function, then itwould be no problem to have a very low limit for SPF records that aregiven by authoritative server to queries, and an arbitrary, much higherlimit for the records that the compiler uses as input.

In that case, the spec might say something like "receivers must check upto X queries; if a record is hosted on a compiling server, it mustcompile to an equivalent record of X or fewer queries; otherwise, itmust contain no more than X queries". Note that when a record getscompiled, it is the number of resulting bytes that matter ultimately. Ifyou list a single A mechanism that has 1000 IP addresses, then thatwould require more than X linked records after compilation. On the otherhand, if you have a very convenient record that contains lots ofredundancies, it may reduce to a very small record.

I'm beginning to think that mechanisms with {ip_addr} and{sender_username} macros should be limited to 1 per domain, becausethey're not cacheable. {d} and {o} macros can be allowed anywhere, asthey have no consequence on caching, but this may need more carefulconsideration.

Thus, the record configured by the zone admin and the record served bythe server software would be completely different, but equivalent. Theformer would be heavy with convenient A, MX, INCLUDE mechanisms, whilethe later would be a list of IPs ending in a redirect.

I think the compiling-DNS server idea is brilliant, and it is just thesolution to the DDOS problem described, but only if it is coupled with alow limit on the checker processing. Otherwise, it's marginally useful.

The version of libspf2 that I am working on will include compilefunctionality, and I will also demonstrate a patch to MyDNS thatimplements a built-in compiler. I suspect it will be a very simple patchas the compile function is in the library. Then perhaps others canfollow the example and create patches for other name servers.

If standardization of SPF is some time away (1 year or more), there willbe plenty of transition time for NS servers to be updated. Note that notall domains will need to update their server, they can simply usespfcompile and $INCLUDE its output, or just publish records that complywith the draft. Public NS services that allow users to publish arbitraryTXT records will likely want to consider updating, in order to offermaximum convenience to their customers.

This way, the DNS servers are not _required_ to be upgraded. If notupgraded, the records published through them will have to be cheap. Ifupdated, the records published through them will be convenient. I thinkthis will be a good incentive for those who want to publish convenientrecords to upgrade, without forcing anyone else to upgrade.

Some discussion happened before on how the record is compiled, and theTTLs. This will be very important to get right. For instance, thecompiled TTL record should have a TTL of the shortest TTL among theexpanded mechanisms. (if you list a 3-hour A record, the compiled TXTwill have a 3H TTL; but if all your listed mechanisms have a TTL of2-days, the compiled record will also have a TTL of 2 days). Thus, thecompiled record will be TTL-equivalent to the input SPF record. Also,the SPF record doesn't need to be compiled more often than its resultingTTL. Also it will be compiled once when the zone is reloaded or theserver started, and the compiled version will never be stored andreloaded into an active server (ie, it will be a dynamic compiledrecord, not a static compiled record).

The libspf2 library I am working on is doing all these things that Idescribe.

It has been mentioned that the %{i} macro could be included in thequery, and then the server could reply with PASS/FAIL. I think this is abad idea, because all those queries are uncacheable, so this trulycircumvents the benefits that were designed into DNS. When the DDOSattempt does happen, caching can really help lower the impact. It may bethat I didn't understand the proposal well enough.


Regards,
Radu.