Re: bouncing emails without real names

At 21:14 2003-03-01 -0800, Ben Crowell did say:

Does anyone have a good recipe for bouncing e-mails from people who don'tgive their real name in the From: line?

_real_ name ? Like how do you suggest anyone determine what is real ornot? For that matter, often the email address itself is their name, sothere's no need to have a text version of it (for instance, for personalcorrespondance, I often use firstname(_dot_)lastname(_at_)domain(_dot_)tld).

I'm a teacher, and I'm tired of nagging my students not to send
me e-mails from addresses like ppkaka(_at_)aol(_dot_)com (no, I'm not
making that one up!) with no real name listed, and
expect me to know who they are.

Some people _sign_ their emails. You could simply elect to post a policyof refusing to respond to messages which fail to provide any form ofpersonal identification (be it in the From:, or in the body).

You should try to keep in mind that many people don't have control of howtheir email provider generates the From: - either because they use afreemail account which doesn't bear their own name, or have a sharedaccount at home, or their corporate mail system lacks this configuration atthe user level.

Whatever the reason, I think it is rather presumptuous to declare thatpeople can't send you mail unless their name appears in some syntax _you_recognize - you're likely to hammer some hapless user who _does_ have theirname in the From:, but just in some format you overlooked.

It is far easier, and much less prone to errors, to simply have thestudents _register_ their email address(es) with their names with you whentaking the course, and you rewrite the From: line to include their studentname (and possibly, other identifying info - a student ID number, orwhaever is necessary for you to better categorize them - esp. if this mightbe a course where students are submitting homework assignments viaemail). This extended id mechanism could be used to file awaycorrespondance in folders (or an SQL database), and/or update entries in adatabase indicating that the user submitted something on time or late orwhatever as per the due time for an assignment (though verifying that themessage actually _contained_ the assignment material is a different matter- however, at least you'd have a student reference value to key off of).

When I was trying to test my code, I made up a yahoo.com address so Icould send e-mail to myself. (Didn't want to test by sending mail from mynormal address to my normal address, since that could create a loop.)

Suggestion: use a sandbox. You don't even need to _send_ yourself messagesto test the operation, and you don't _actually_ send the autoreplies - theyrun through a "sendmail" redefinition and get dumped to a file, allowingyou to review what _would_have_ been sent to sendmail.

As for loops, if you use X-Loop along with logging, you could manage toensure that you don't loop on yourself. If you're going to sendautoreplies, you'd better use X-Loop in any event.


studentaddrfile is simply "email_address<space>(parenthesized name)", like so:

bcrowell99(_at_)yahoo(_dot_)com (Ben Crowell)
pooftah(_at_)sexfreak(_dot_)org (Some Inconsiderate Lout)

Students register their email address with you for the class (possiblythrough a webpage which generates the above file, if it is too much for youto transcribe it once a semester), and you might also whitelist theiraddress (so it doesn't get thrashed as junkmail, say because you have spamfilters as well), AND you have a list of the addresses along with studentnames (as above).


Off the top of my head (i.e. not necessarily as optimal as it could be):

:0
* some condition for "addressed to teacher"
{

        # extract the From: field AS AN ADDRESS COMPONENT ONLY
        FROM=`formail -IReply-To: -rtzxTo:`

        # Now, grab the address from our registered address file
        # anchored to the beginning of line, AND with a space trailing
        # the address, so it should be expected to match a _complete_ address
        # component, and not be tricked by a substring of one
        # ("user(_at_)domain(_dot_)com" being a substring of
        # "someuser(_at_)domain(_dot_)company(_dot_)com" for instance)
        NEWFROM=`grep ^"${FROM} " studentaddrfile`

        # if NEWFROM isn't empty, rewrite the From: line.
        # This rewrites the From: line to your new version (including the
        # student name) but retains the original From: as Old-From:
        :0f
        * ! NEWFROM ?? ^^^^
        | formail -i "From: ${NEWFROM}"

        # optional, do something if the address ISN'T in our DB.
        # (this doesn't mean that their From: doesn't contain desired info,
        # just that they're not registered with the From: addr).
        #:0E
        #| do_something_actions
}

If you have no such "to teacher" condition (say, a plussed address, or aseparate address within a virtual domain), remove the outer condition andbracing.

One would want to structure things so that this kind of bounce happens
only to e-mails we're pretty sure are not spam.

Whitelisting the _registered_ student addresses would be a good way to dothat - however, in doing so, you deftly eliminate the problem since you canrewrite the From: as demonstrated above. Thus, no need to bounce anything,no risk of bouncing messages to people who really shouldn't be assaultedbecause you don't like the addressing of their messages, which they may notbe able to directly control.


BTW, _YOUR_ yahoo message arrived with an _unquoted_ name:

        Ben Crowell <bcrowell99(_at_)yahoo(_dot_)com>
vs.
        "Ben Crowell" <bcrowell99(_at_)yahoo(_dot_)com>

---
 Sean B. Straw / Professional Software Engineering

 Procmail disclaimer: <http://www.professional.org/procmail/disclaimer.html>
 Please DO NOT carbon me on list replies.  I'll get my copy from the list.


_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail