Re: [Asrg] definition of spam (was Re: consent expression)

At 8:58 AM -0500 3/6/03, Keith Moore wrote:

I also don't believe in treating one-to-many email differently than one-to-one
email, because it's the content of the email that the recipient cares about
rather than the number of recipients.


I disagree, and I have several examples.

I received a very nice letter (complete with a .sig of a sleepingcat) from someone wishing to subscribe to "your mailing list". I runseveral mailing lists. I came very close to replying, but the senderhad pushed the limits just a little too much. She included a verynice (and perfectly innocent) photo in her mail--that's not a normal.sig. Investigation showed that the URL in her .sig was for asoft-porn site. Other than the picture (and some people *do* sendemail with their picture in the .sig, believe it or not), the onlything that distinguished this email from email that I normally getwas the fact that it was sent to lots of people, not just me. A factthat could not be determined by any information provided in themessage.

I received an abuse report. It was a complaint that someone wasspamming using my domain, and advertising a particular site. Alsosomething that I get not infrequently. The content was perfectlynormal. The key was that I received the same report at twocompletely unrelated email addresses. It was in fact spam for thesite listed in the abuse report.

Finally, one that came up at the MIT Spam Conference, and I'm suremany of us have seen. You receive a request to participate in aprivate conference. Sounds like a great opportunity to meet withpeers working on similar things. But if you realized that the sameinvitation was sent to 100,000 other people, your opinion of the ideavery quickly changes.



I believe there are three components to identifying spam.

1. Routing information*
2. Content analysis
3. Bulk identification

Neither 1 nor 2 are completely sufficient without 3. The fact that amessage was sent to many people can significantly change itsinterpretation.

My personal belief is that #2 is a bad idea. Content filtering isgood when your goal is to filter content. However it's a lousy wayto identify spam, because you basically spend all your time trying tofigure out what the spammers are selling, and trying to distinguishhow they sell it, from how legit mailers sell it. Right now thisisn't as hard as it might be, because the two groups tend to selldifferent things. But as the two converge, and as spammers continueto actively evade countermeasures, it will get harder and harder.

I prefer routing information because it's based on the assumptionthat spammers either a) have a safe-haven and can be blocked, or b)are trying to hide where they are coming from. Detect the lies, andyou know it's spam. When you are looking for lies in the routinginformation you spend most of your time dealing with poorlyconfigured legitimate mail servers. It's much easier to deal withproblems caused by mistakes than trying to deal with people who areactively trying to fool you.

* I don't mean a simplistic examination of the Received: headershere. But this isn't the place to go into the details.

--
Kee Hinckley
http://www.puremessaging.com/        Junk-Free Email Filtering
http://commons.somewhere.com/buzz/   Writings on Technology and Society

I'm not sure which upsets me more: that people are so unwilling to accept
responsibility for their own actions, or that they are so eager to regulate
everyone else's.
_______________________________________________
Asrg mailing list
Asrg(_at_)ietf(_dot_)org
https://www1.ietf.org/mailman/listinfo/asrg