Re: [Asrg] Some data on the validity of MAIL FROM addresses

At 03:34 AM 5/18/2003 -0400, Kee Hinckley wrote:

Vernon has regularly made the claim that a significant proportion of
spam messages have valid MAIL FROM's.  That means that bounces will
go the the spammer.  This has significant ramifications for C/R
systems (especially auto-respond ones) since it means that should
they have to, spammers could respond to challenges.

To test this theory, I took a day's worth of bounce logs from
somewhere.com (2003-05-15).  These should be fairly normal logs.
There's been a bit of an upswing from a recent virus attack, but
otherwise these are pretty normal bounce logs for somewhere.com.

I ran a program which took each MAIL FROM address, parsed out the
domain portion, looked up the MX record, and then connected to the
SMTP port of the lowest numbered MX server.  I did a
        HELO somewhere.com
        MAIL FROM 
<postmaster+AntiSpamAddressVerification(_at_)somewhere(_dot_)com>
        RCPT TO <appropriate-address>
        QUIT
Note that a few sites bounced me at the HELO prompt (didn't like that
I was on DSL, or that my name was somewhere.com).  A few bounced at
the MAIL FROM (didn't like somewhere.com--and one claimed that +
wasn't a legal email character).  But the number of either of those
was pretty low (less than half a dozen).  I'll do a better job of
recording those separately in the future.

[.....]
In general though, it appears that Vernon is correct.  If my sample
is representative, a large percentage of spam is coming from real
email addresses.

I see a problem with this testing strategy - an SMTP server is does notnecessarily produce an error when receiving an RCPT TO command. See RFC2821, section 3.3:


----[snip]----

"However, in practice, some servers do not perform recipient verificationuntil after the message text is received. These servers SHOULD treat afailure for one or more recipients as a "subsequent failure" and return amail message as discussed in section 6. Using a "550 mailbox not found"(or equivalent) reply code after the data are accepted makes it difficultor impossible for the client to determine which recipients failed."

----[snip]----

And RFC 2821, Section 6.1.:

----[snip]----

"If there is a delivery failure after acceptance of a message, thereceiver-SMTP MUST formulate and mail a notification message."

----[snip]----

Therefore, it is not possible to determine with certainty whether theseaccounts actually existed. A better testing strategy would actually sendemail to these accounts with the DATA command and watch for bouncemessages. However, spammers can always choose to use a real email addressas the return address and sending email to valid accounts in itself may beconsidered spam by the recipients.


Yakov

---------------------------------------------------------------------------------------------------
Yakov Shafranovich / <research(_at_)solidmatrix(_dot_)com>
SolidMatrix Research, a division of SolidMatrix Technologies, Inc.
---------------------------------------------------------------------------------------------------
"One who watches the wind will never sow, and one who keeps his eyes on
the clouds will never reap" (Ecclesiastes 11:4)

---------------------------------------------------------------------------------------------------

_______________________________________________
Asrg mailing list
Asrg(_at_)ietf(_dot_)org
https://www1.ietf.org/mailman/listinfo/asrg