Re: Last Call: 'Key Change Strategies for TCP-MD5' to Informational RFC

On 28-sep-2006, at 22:54, The IESG wrote:

The IESG has received a request from an individual submitter toconsider
the following document:

- 'Key Change Strategies for TCP-MD5 '
   <draft-bellovin-keyroll2385-03.txt> as an Informational RFC

The IESG plans to make a decision in the next few weeks, and solicits
final comments on this action.


Since my earlier comments to the author were apparently ignored:

Implementation of this draft allows for misconfiguration andoperational problems that can't happen without implementation of thedraft. As such, it should be considered harmful and the draftshouldn't be published, regardless of its intended informational status.

When two routers run BGP with the TCP MD5 option, and one implementsthis draft, key rollover will indeed be easier, if the upgraded sideis first configured with the new key (which it won't use at thispoint), then the non-upgraded side is configured with the new key andimmediately starts using it, which is detected by the upgraded side.This completes the key change immediately after the non-upgraded sideconfigures the new key.

The problems start when BOTH sides implement the new mechanism. Inthat case, new keys will remain unused for some time, and then becomeactive at some hard-to-determine time in the future. (Neither sideknows for sure when the other side will switch to the new key.) Thismeans that there will be a problem in the case where the new keyisn't present on both sides, for instance because one side wasn'tconfigured with the new key in a timely fashion, despite out-of-bandagreement to do so, the keys configured on both sides don't match.

In this case, one side will start using its new key at some point intime. If the other side doesn't have the same key, it can't validatethe TCP segment so the segment is dropped. In theory, it's possibleto recover from this condition by adding logic that observes the TCPstate, but I don't see how this can be made fully reliable,especially given the wide variety of TCP implementations and otherenvironmental factors such as BGP (in)activity, packet loss andreduced response times because of high CPU loads.

So in a good number of cases, TCP segments remain unvalidated for toolong and the BGP session breaks. The really bad part is that thishappens at some unpredictable interval AFTER operator action, sooperator error doesn't create any usable feedback. Today, feedback isimmediate and conclusive. So the new situation is vastly inferiorfrom an operational robustness perspective.

This problem can easily be fixed by adding a BGP capability code anda new BGP message. The capability code would indicate support for thenew message, and the new message would be used by each BGP speaker tocommunicate the availability of a new key, along with a hash over thekey so the BGP speakers know at which point the other side has thenew key available, and that the new key is indeed the same as thelocally configured one. These types of additions to the BGP protocolare well-understood and shouldn't lead to significant additionalimplementation difficulty.

(What follows isn't specific to the draft under consideration andshouldn't be taken as input on how to change this particular draft.)

As long as I'm taking up bandwidth, let me address a more fundamentalproblem with this draft and several others addressing the same orsimilar issues. (It would be nice by the way to have a single venuewhere all of this is discussed, in Montreal the discussion moved fromworking group to working group and was therefore extremely hard tofollow for everyone who didn't make an express effort to do so.)

The real problem is agreeing to a key with people from another AS.It's not uncommon for network operations staff for two ASes to residein different timezones, to speak different languages and to havewildly dissimilar operational mores. This makes seemingly trivialtasks such as finding a person who can agree to a key and finding asecure channel to communicate the key very hard. The particular issuethat this draft addresses, which is agreeing on a time when the keysare changed, is indeed also an issue but in my experience, it's notthe most problematic one in practice. The reason for this is that inpractice, keys are rarely changed after they've been set upinitially. I estimate that I've done some 200 inter-AS-session-yearsworth of BGP operational management, and I can't remember ever havingbeen asked to change an existing BGP TCP MD5 password. The assumptionthat these keys are so sensitive that they must be changed regularlysimply doesn't hold in practice.

But suppose that the keys must indeed be changed often. The problemsunrelated to the actual time of the change remain unaddressed here.This is also true of the other proposals that I'm aware of, whichaddress other problems such as the weakness of the MD5 hash and theway in which it's used here. In order for network operations to beable to change the actual session keys often, it's necessary to basethe actual session keys (and preferably, the keying informationconfigured on a router) without the need to agree to any specifickeying information out-of-band. This probably involves some kind ofpublic key encryption, where a session is not configured with anactual secret key, but with a fingerprint for a certificate held bythe remote router, or, better yet, the remote AS.


_______________________________________________
Ietf mailing list
Ietf(_at_)ietf(_dot_)org
https://www1.ietf.org/mailman/listinfo/ietf

Re: Last Call: 'Key Change Strategies for TCP-MD5' to Informational RFC (draft-bellovin-keyroll2385)