Re: [ietf-smtp] Reducing minimum recipient limit?

On 12/16/19 3:29 PM, Brandon Long wrote:

On Mon, Dec 16, 2019 at 5:45 AM Keith Moore<moore(_at_)network-heretics(_dot_)com <mailto:moore(_at_)network-heretics(_dot_)com>> wrote:
    On 12/16/19 8:01 AM, Tony Finch wrote:
    this an area where practice appears to have diverged from the standard.
    I not keen to see the limit reduced, but perhaps advice to clients to
    default further below 100 than they may doing already may be warranted.
    I can only speak to Postfix, which defaults to sending up to 50 and
    accepting 1000.
    I would like a requirement that servers must not reject the entire message
    if there are up to 100 RCPTs even if the server returns 452 for some of
    them. Senders can then pipeline safely.
    Part of me wonders why there's a minimum number of RCPTs at all
    if, practically speaking, the client has to be able to deal with a
    server-imposed limit no matter where it is.

    Seen in this light, the minimum requirement seems like more of a
    guide for optimization by clients than a requirement for servers.

    Then again, if existing clients are currently assuming 100 or some
    substantial fraction thereof, I would not want to break them by
    telling servers that they can now impose whatever limit they like.

    I am tempted to suggest that there be a MAXRCPTS EHLO response to
    let the server advertise its explicit limit.  But again, this
    would give license to servers to change existing limits.

    Other than the rare server that really cannot store 100
    recipients, is there some operational reason to permit fewer?  
    Did someone empirically determine, or arbitrarily decide, that
    spambots would be more likely to send 100 recipients than
    legitimate senders?
The limit that Gmail imposes is not based on a specific number ofrecipients. Instead, it's based on available resources and thein-memory footprint of the rules necessary for handling eachrecipient. GSuite allows for creating rules that can be madecomplicated by admins who do things like try to implement their ownanti-spam system, or write tools to convert from their previous systemto ours in less than ideal conversions, as well as having differentrules per recipient... and then claim there is no way to reduce theirrules, regardless of how un-useful they are (see the other threadabout ip-literals in EHLO except writ large). Couple that with someless than ideal implementation for the rules system itself (probablyfine for small sets of rules, but very painful for large sets) and youend up with some worst case scenarios in the 50MB per recipient range.
There's an open bug to remove that particular resource based tempfailures as we've evolved the system to reduce the problem, includingproperly 4xx'ing recipients that have different rules (before therewere complicated attempts to merge the results or accept messages andgenerate bounce messages later if there were mixed results), so we maygo back to accepting the rfc standard minimums most of the time next year.
That said, multiple addresses per transaction has been a pretty rarefeature. I'm sure our workload is biased because of how rarely wesend multiple transaction mail ourselves (see also enterprise rulesengine on sending with different rules per recipient), but last Ichecked the average per transaction was 1.1. As we've had to movemore computation to be at smtp time, some of which is per-recipient,including too many recipients in a transaction increases the number ofoperations that have to complete successfully, some of which may becross oceanic (you think these two addresses are in the same domain,but they actually work for different parts of an internationalenterprise which has different data geoloc requirements for therecipients in different offices), increasing the chances of a failurewhich will temp fail the entire transaction. Keeping the number ofoperations small and contained leads to more predictable performanceand better fault isolation.
Ie, its cheap to say "send this to all these people", but much harderto know whether every one of those people can accept it.
Brandon

Reading this, it appears to me that this is basically a tradeoff betweenreporting per-recipient failures during the SMTP session versusreporting (some of) them in via one or more NDNs. Of course NDNs havevarious costs, not only to generate and transmit them, but also inaccepting a lower probability of the errors actually being reported tosome party that can address the issue that caused the delivery failure.

Regarding what the standards prescribe, my sense is that the standardsshould give implementors a lot of leeway, but should be constructed asto /maximize the reliability of successful delivery of (legitimate)email/. (Granted that "legitimate" is hard to define but I think ithas to do with two broad factors - one is whether the content of themessage is appropriate, and the other is whether the message iscorrectly representing its sender and recipients and signal path; andNOT anything to do with things unrelated to message content, say, howmany recipients were in the envelope or whether an IP address literalwas used in EHLO.)

So does a hard requirement on how many recipients a server should beable to accept, improve or degrade the reliability of email delivery? I'm not sure, but I immediately see two consequences of not having aminimum requirement:

1. having a minimum recipient requirement appears to shift some of theerror reporting to NDNs, with lower reliability for such reporting.

2. not having a minimum requirement may shift some implementationburden (and potential for errors) from servers to clients (by requiringclients to deal more flexibly with SMTP error codes), perhaps resultingin more errors occurring earlier in the signal path with consequentlyless probability of messages being delivered. Alternatively, theclient can behave like qmail and only relay one recipient per envelope. This makes the client code simpler and less error-prone, but it alsoconsumes more network resources, client resources, and server resources,which might also have an adverse effect on reliability if resources arelimited at any point on the signal path between client and server.

I suspect that all of these considerations pale in comparison with theadverse effect on reliability caused by bogus spam filters (which seemsto be part of the reason for not supporting a minimum number ofrecipients per envelope). So I might be inclined to suggest thatanything that encourages more message filtering via dubious criteriashould be discouraged by future revisions of the standards and/or by anyoperational recommendations we might someday publish.


Keith

_______________________________________________
ietf-smtp mailing list
ietf-smtp(_at_)ietf(_dot_)org
https://www.ietf.org/mailman/listinfo/ietf-smtp