Re: [Asrg] About that e-postage draft [POSTAGE]

Peter J. Holzer wrote, On 2/22/09 6:53 AM:

On 2009-02-21 19:07:09 -0500, Bill Cole wrote:

Steve Atkins wrote, On 2/20/09 7:26 PM:
On Feb 20, 2009, at 3:25 PM, Bill Cole wrote:
[Quoting John Leslie]
 The bottom line is, redeeming a million tokens per second is practical
with processing delay not much greater than network latency. (This was
not true ten years ago...)
I think I'd quibble with details on that, but they really are not allthat
important.
I guess maybe one detail is...
If the processing delay for every redemption attempt is of the same orderof magnitude as irreducible network latencies, i.e. tens to hundreds ofmilliseconds, handling a million one-use token redemption attempts persecond is absolutely hopeless.


I think John meant "processing delay" as seen from the client. So this
includes a) the time to send the request (tens to hundreds of
milliseconds), b) the time it takes the server to process the request
(sub-milliseconds?) and c) the time for the response to get back to the

client (again tens to hundreds of milliseconds).

The number of requests a server can process per second is mostly
determined by b). The communication delays only increase the number of
open but idle connections (which may also be a problem).


Just in case anyone misconstrues your use of the term "connections":

Using one TCP connection per transaction for a system that needs to completea million transactions per second across the Internet would be aridiculous design flaw.

I therefore assume that you mean "connection" in a more generic sense, andindeed that was the core of my argument: if the server has to retain stateon transactions for typical Internet RTT's for a common class oftransaction, concurrency is likely to become a problem in and of itself.

So if checking a single token takes 1 millisecond (rather conservative,
IMHO), one server can check roughly 1000 tokens per second, so a server
farm of 1000 servers can check 1 million tokens per second. That doesn't
seem "absolutely hopeless" to me. It's certainly technically possible,
although I'm sceptical whether it's economically feasible.

Your hand waves so elegantly when forming the phrase "server farm of 1000servers."

I believe that dividing the workload between multiple front-end machineswill either create unworkable latencies in the back end to assure that allof the front ends see a coherent state database for tokens or else willcreate vulnerabilities that will allow any significant botnet operator toeffectively eliminate chunks of the system at will. Hand-waving a 1000-nodeserver farm doesn't persuade me that I'm wrong.

I'm pretty sure that I'm not the best systems analyst/designer on thislist.I certainly hope I'm not the best one to have thought about e-postage.I'dbe happy to learn from a master how it is in fact possible to make anideally simplified minimal system like this work as a starting pointfor how to assemble a more complex system that has more elements ofreality in it. I think (but may be wrong!) that it isn't possible todesign a system that will be theoretically capable of correctlyhandling a million redemption requests per second of which ~90% arethe result of someone working to break the system.
It's fun to consider, though.
Using your numbers - one million redemption requests a second of whichat least 90% are invalid, leads to 100,000 outstanding valid requestsper second, which would give around 250 billion outstanding stamps atany one time, if we expire them after a month (expiring would likelyinvolve voiding the unused stamps and issuing new ones in the sameamount, but that's a business issue, and doesn't affect the redemptionrequirements).
I expect most tokens would be redeemed pretty soon, so you won't have
to keep them around for a month, at least not for real-time checking.

Someone who thinks e-postage is workable should make a real specificationthat defines "pretty soon" in concrete terms that would support an actualprotocol design that expires tokens. That "someone" won't be me, and Iexpect that it won't be anyone else who has been asking for such aspecification. E-postage has been at the hand-waving idea stage for over adecade, and it's becoming a bit comical.

To me it feels like the hard bit of this is handling a million packetsin and out per second reliably, along with the overhead of providingrobustness and redundancy, rather than the redemption itself.
That was my point, because it seems to me that a redemption cannot be donewith just one packet in and one out, but really needs two in and one out.A legitimate stamp needs to have 3 possible states in the server's map:redeemed, unredeemed, and pending acknowledgment of redemption. If theserver only has two states for a stamp, then it would end up with one oftwo flaws by design:
1. If the stamp is marked as redeemed when a successful redemption attemptcompletes on the server, it is possible that the success will not besuccessfully communicated to the client. If the client then retries theredemption, it will fail.
Yes, but is this a problem which needs to be avoided at all cost?

I think so, but then I think the whole e-postage concept is a misguidedfantasy in multiple aspects. If you think it could actually survive havingrandom stamps made worthless and indicative of fraud by dropped packets, Iurge you to implement it and demonstrate that people will tolerate such asystem.


> If the

redemption fails, the client can simply buy another stamp and send the
message again. Delivery of that particular message was now twice as
expensive as that of an average message, but if that only happens
infrequently it doesn't matter. If it does happen frequently for a
particular bank, the affected client(s) will probably be reconfigured to

prefer tokens from other banks.

There's not any strong reason for such a problem to be bank-specific. Itcould just as easily be tied to where the redemption attempt is coming from.

So if your 50k clients try to redeem the same token at the same time,
one of these 50k requests will be the first to be processed by the
server. The server marks the token as redeemed and sends a positive
reply to the client. The other 49999 requests will be denied. If
(because of network congestion caused by the 50k nearly simultaneous
requests) the reply never reaches the "lucky" client, the token will be

lost.

Sending 50k requests with the same token is garantueed to yield at most
one positive reply in this case. It isn't viable for someone who wants
to send many messages, but it may be a viable DoS attack.

Botnet spammers have a history of attacking anti-spam systems. Any mailingtactic capable of crippling an e-postage system, even temporarily, will betried.

2. If the stamp is left as unredeemed while waiting for the client ack ofsuccess, stamp "reuse" becomes a question of how many redemption decisionscan be made per client RTT.
The server may have to defer many thousands of clients for scores ofmilliseconds while waiting for one to send an ack.
Yes, but it only needs to defer those clients which sent a request with
the same token. It can process lots of other tokens in the meantime.


If designed correctly, that might be the case. What is that design again?

Spammers will work to break any e-postage system. They will be the indirectsources of most of the redemption requests, so whatever inconvenient casesare possible can be expected to make up almost all of the request traffic.

Also the server doesn't have to keep the connection open. If the token
it wants to check is currently in the "pending" state, it can reply with
a temporary error and leave it to the client to retry at a later time.

Then the system is NOT handling that redemption attempt. The million-TPSscaling is based on the number of redemptions that have to be completed, nottried. If trying a redemption once doesn't dispose of that token, you needto scale up to handle the retries.

This is a problem that is familiar to many MTA admins: if you defer clients,you increase how many client connections you get.

I think the ways to handle that all include dividing the front end
between multiple machines, but that creates a tougher problem keeping
the back end recordkeeping fast and coherent from the viewpoints all
of the front ends.


I think it is possible to solve the coherency problem by using the token
to choose the server. Then there is always exactly one server
responsible for a specific token and no coherency problem does occur.
The draft doesn't specify the protocol used to buy or redeem tokens, so
this could be done in the client: A token might consist of two parts,
the first one identifies the server to connect to, the second is unique
within that server.

So, you have something embedded in the token that tells the MTA trying toredeem it which one of 1000 servers to contact for redemption, each one ofwhich is capable of handling 1000 redemptions per second? With spammerscontrolling botnets of 100k machines? Is there a punch line?

If e-postage is ever going to be more than hand-waving about a concept,someone who is able to put in disciplined, serious system design work isgoing to need to be convinced that there's enough potential in e-postage toput together a redemption protocol that doesn't generate derisive giggling.

I suspect that the "solution" that will be chosen if anyone tries to createa real e-postage system instead of hand-waving about it will be to open itto lost packet damage as the cost of scalability.
Yes, that's what I'd probably do.


Why not actually do it?

Network latency with clients becomes irrelevant if the server assumes
that its redemption messages are always delivered. That allows for a
lot of optimization. Once in a while a stamp that should work will
fail to do so, and if such a system ever gets into the real world I'm
sure its users will be shocked at how much higher the real-world
failures are than in their tests...


If you are still cheaper than the competition, that doesn't matter. If
you aren't, but already have sufficient market share (because you were
one of the first), it might not matter, either. Otherwise you're out of
business.

Demonstrate that *anyone* will accept a new sort of currency designed torandomly be worthless based on the vagaries of Internet packet receptionbefore making up stories about competitive providers of such magic scrip.

_______________________________________________
Asrg mailing list
Asrg(_at_)irtf(_dot_)org
http://www.irtf.org/mailman/listinfo/asrg