Re: Unified SPF: block versus factored records for HELO and MTAMA ark sc



On Jun 25, 2004, at 12:30 AM, Hallam-Baker, Phillip wrote:

Most Unix MTAs including sendmail, exim, qmail and (I believe) postfix
fork off a process per SMTP session which exits at the end of the
session.  In this case, any effort beyond that to validate the single
IP used in the session is wasted, and the local DNS cache is where the
info will be remembered.  My impression is that there are a lot of
large sites whose MTAs work this way.


I do not know what the situation is where dedicated edge servers
such as spam filtering systems are concerned. I strongly suspect
that these are based on very different architectures. Perhaps someone
could go and look at what some of these systems do.

Our Edge appliance (and Ecelerity MTA software) does not use a perprocess model. It actually doesn't use a per thread model either. Itis a hybrid model where non-blocking ops are put into a small group ofevent engine threads and blocking ops are put into a larger group ofworker threads.

As DNS queries have no good reason to block (there are a handlful offree nonblocking DNS resolving libraries out there) we can realize highconcurrency and throughput with DNSBL, SPF, DomainKeys, etc. allperformed inline. 100,000 concurrent SMTP sessions all performing DNSbased lookups, MySQL lookups, ldap lookups, virus-checks and a slew ofother things in real-time (during the SMTP session).

Even on a UNIX box there are good reasons to prefer a per thread model
to a per process model, process creation and teardown in UNIX is only
lightweight compared to other O/S.

Threads have costs too. Smaller than processes in most cases andalmost negligible if they are user-space threads. However, each hasits pros and cons and you have to carry around that thread-stack. Soone thread-per connection can be a big big waste.

There are also other ways that a UNIX system may be organized. For
example if I was operating a large cluster of mail filters I would
hive off the task of fetching MARID data and resolving all forms
of reputation and accreditation data in a separate system that
performed caching.


Yes... perhaps not "separate", but certainly cached.

I would not want to store state in the mail server itself for the
reason you mention and also because that would load up my external
connections (which I pay for) rather than my internal 1Gb ethernet
connections which are essentially free.

You can used a replicated data store. Shared consistent replicatedcache across the mail servers. Much better and more reliable than a"backend store" as they usually have to be over-engineered to avoid asingle point of failure.


// Theo Schlossnagle
// Principal Engineer -- http://www.omniti.com/~jesus/
// OmniTI Computer Consulting, Inc. -- http://www.omniti.com/
// Ecelerity: fastest MTA on Earth

Re: Unified SPF: block versus factored records for HELO and MTAMA ark scopes