Re: [openpgp] 4880bis: Update S2K

In summary, I would be more concerned with users getting wrongimpression that somehow their low-entropy password will get "upgraded"by a new S2K scheme. There is a cost of change, and I think if theOpenPGP does change the S2K, the choice should bring concrete additionalbenefits.


Here is why:

1. One should keep in mind what value the iterative methods like S2K orPDKDF actually provide. Assuming that the number of iterations is c andthe password space has 2^n entries, the work factor for legitimate usersis c*2^n, as opposed to 2^n without iterated calculations. Doubling thec slows the computation time in half for the attacker as well as for thethe password holder; this is equivalent to adding 1 bit of extra entropyto password space. This offers one of the worst cost-benefits tolegitimate users. Just compare this with any other computationalsecurity schemes, such as public key crypto (exponential v.s. polynomialcomplexity).


2. The Iterated S2K is essentially a

   M = M1 || M2 || M2 || M2 || ... || M2, where M1 includes the salt.
   S2K = Hash( M )

(assuming that hash output is no smaller than the number of bitsrequired for the key).

In many respects, this construction is easier to analyze. There is noattempt to "fix" the hash function as in some other schemes. Imagine asponge construction like SHA3 Keccak or many other hash functions withsufficient internal state (probably including SHA-256), which should befine to handle the above task.

3. The max number of iterations that the S2K counter encodes is c =16+15^(15) + 6 > 2^58

For SHA1 S2K this is over 2^53 invocations and about the same forSHA-256. SHA-256 can only hash 2^64 bytes. Likewise, AES-256, as any16-byte block cipher, is considered insecure after about the same numberof invocations due to the birthday bounds limitation.


This is plenty of iterations for the foreseeable future.

4. One can argue that the S2K construction is one of the strongest inactive use today. If we assume that it's possible to recover M given theS2K as defined the above, i.e. to invert a hash function (even assumingsomehow that S2K value is known, which it is not) then many existingschemes will be broken. For example, an attacker that is able to get Mfrom S2K can use the same method to recover the authentication key Kfrom an HMAC MAC value, leading to the forgery of MAC.

The proof is easy: use the "oracle" that returns M from S2K = Hash( M )to recover the HMAC key as follows: get M' = K ^ opad | H( K ^ ipad | m), which reveals the K as M' ^ opad with appropriate truncation.

The other problem one might try to solve with S2K construction is howthe scheme behaves with non-uniform hash functions. ( Imagine that ahypotherical Hash() often returns all-zero output for random M).However, we don't intend to use these hash functions, will quicklyreplace them if we discover that we do.

It's hard to see that S2K and PBKDF are materially different. ( In lessformal language: PBKDF2 should have the same resulting weakness, butlong before this the collision resistance will likely be broken ).

Both schemes have the same problems. For example, both have anunfortunate property that if the two invocations of PBKDF (or S2K) usethe same salt and password but different counters c1, c2, the S2K orPBKDF can be calculated from the previous values with just |c2-c1|iterations. I.e. it seems like an oversight that c was not hashed. Aswas noted by others, it may be desirable to increase memory footprintrequirements.

HKDF is not designed to handle S2K. (It's main target are things likederivation of an AES key from a DH shared secret). At least it needs thestretching step defined.


_______________________________________________
openpgp mailing list
openpgp(_at_)ietf(_dot_)org
https://www.ietf.org/mailman/listinfo/openpgp