Consent to Receive Mechanism

Phillip M Hallam-Baker

VeriSign Inc.

Abstract

We propose a mechanism Consent To Receive that allows an email recipient to provide verifiable ‘opt-in’ consent to receive email. Consent to Receive may be used to automatically whitelist email from mailing lists and email forwarded from another email server.

Consent to Receive is designed to require minimal state maintenance by both the email sender and the recipient and to be deployable with minimal impact on existing email infrastructure.

Requirements

The Consent to Receive mechanism provides a verifiable and revocable proof of consent to receive email from a specified source.

Subscription Consent Problem

The SMTP protocol does not provide a means to allow a mail server to determine whether a mailing list message was solicited or not.

Reputation Attacks

An SMTP mail server may be falsely accused of having sent messages to a non-subscriber.

As the situation currently stands there is no way to determine the truth or falsehood of such allegations. A party may even sign up for a newsletter with opposing views so as to be able to complain about the messages sent and hope to cause the mailing list to be put on a blacklist.

Events of this type have occurred in connection with mailing lists of every political persuasion. When complaints are made about a blacklisting they are typically met with further hearsay accusations that the accused was ‘notorious’.

Unsubscribe Problem

Mailing list users often find it difficult to unsubscribe. In many cases requests to be unsubscribed are sent to the entire list.

The un-subscription problem can be a serious problem when an intermediary such as a mailing list gateway is involved.

Abandoned Subscription Problem

Users often fail to unsubscribe from mailing lists, causing huge volumes of mail to accumulate unread in an unused account or an unread mail folder.

Often a mail user will subscribe to a mailing list on a speculative basis and find that they do not read the list often.

Abandoned List Problem

Mailing lists are often created and then abandoned after a period of time after falling into disuse. This can create serious problems if a spammer after discovers an abandoned list without an administrator.

Maintenance Problem

The management of a high volume mailing list requires a considerable amount of effort.

Deployment Constraints

SMTP Deployment Limitations

Many SMTP servers are poorly configured and perform arbitrary and in many cases capricious modifications to messages as they are transmitted.

SMTP Client Limitations

Adding features to SMTP clients is a slow process.

Separate Servers for Incoming and Outgoing Mail

Enterprises are increasingly using separate mail servers for incoming and outgoing mail. Even if the end user interacts with a single server it is likely that incoming mail will be pre-processed by some form of mail proxy, particularly if anti-spam filtering is being performed.

Network Protocol Access Limitations

Many ISPs limit or block completely use of certain Internet protocols. These blocks may be imposed for a variety of reasons that include preventing spam, service differentiation and caprice.

Alternative Approaches

The proposals in this paper are designed to communicate consent within a ‘push’ distribution protocol in which the transfer of the message to the recipient is initiated by the originator of the message. Several authors have suggested the replacement of the push mechanism of SMTP with a ‘pull’ protocol in which the subscriber initiates message transfer as a means of mitigating the spam problems.

A pull mode protocol provides the subscriber with direct control over their subscription. The client only presents messages to the user from sources that the user has subscribed to. The chief disadvantage of a pull mode protocol is that it requires the client to continuously poll the service to check for updates.

Network News Transfer Protocol

Network News Transfer Protocol (NNTP) was originally designed as a means of improving mailing list management.

Individual users pull articles from a local news server that subscribes to one or more of the newsgroups in the Usenet hierarchy or one of a large number of private hierarchies. Each NNTP news server is a peer and articles are exchanged between news servers by means of a simplistic flood fill protocol.

The chief disadvantage of presenting NNTP as a spam solution is its vulnerability to spam when operated in peer-to-peer mode. The term spam was originally applied to a junk posting on Usenet which appeared many years before email spam became common, let alone a significant problem.

Although the NNTP protocol does not mandate the use of peer-to-peer exchanges custom and practice strongly discourages use as a direct publication mechanism. While this mode of use is very efficient for high traffic newsgroups with a wide audience it is exceptionally inefficient for newsgroups of limited interest.

Really Simple Syndication

Really Simple Syndication (RSS) is an XML data structure that allows news sites and Web-log ‘blog’ sites to provide notification of updates to the site.

RSS allows publishers to support rich user interfaces with formatted text, sound and video.

The chief disadvantage of RSS is that it is an emerging protocol and RSS capable clients are currently very rare. While RSS may play a significant role in the future of Internet based subscription publishing it is unlikely to be effective as a near term spam solution.

Modifications to Email

The chief deficiency in the email protocols with respect to the requirements is that an incoming mail server has no means of determining whether a recipient did legitimately consent to opt-in.

Existing Features Insufficiently Observed

Many of the problems with mailing lists could be addressed by simply deploying the existing proposals in [RFC2369]. These provide a mechanism that allows mailing lists to use URIs to specify how users can perform functions unsubscribe from a list.

A header of particular interest in this specification is the Mailing-List header that uniquely identifies a mailing list.

Communicating Consent to an Email Server

There are a variety of means by which consent may be communicated to an email server. For example the end user could digitally sign a consent form that would accompany each message sent. Such a mechanism would incur a major overhead since every message sent would significantly increase in size. Alternatively, the consent form could be uploaded to the incoming mail server. This would require the mail server to store a considerable amount of state.

We propose a mechanism that minimizes the need for state maintenance at each stage in the mail transfer protocol, avoiding the need for additional per user records to be stored at either the incoming or outgoing servers.

Initialization

The incoming mail server establishes a shared secret k. In the case that there are multiple incoming servers the shared secret may be established manually or by means of some form of key agreement protocol using public key based credentials.

Once established, the shared secret is maintained for an extended time period such as a year. Incoming mail servers must keep a record of all shared secrets that have ever been used and the validity intervals in which they were used.

Confirmation Code

During the subscription process we establish a confirmation code that is cryptographically bound to the mailing list name, the recipient email address and the date of subscription by means of the shared secret:

Code = MAC (mailing-list, recipient, date)

Where

list
The string the mailing list will use in the Mailing-List header

recipient
The email address of the recipient

date
The day on which the subscription took place.

Each message sent from the mailing list contains a Confirmation-Code header as follows:

Confirmation-Code: <date> <confirmation>

The mailing list already needs to maintain a per-subscriber record of mailing addresses. The additional state required to support the confirmation code protocol is negligible.

We require the subscription date to be stored explicitly for two reasons, first it requires the mailing list administrator to take notice of the age of subscriptions, secondly it allows the incoming mail server to reject mail with a stale confirmation code that is many years old. This allows a mail server to reject mail sent to a mailing list that a previous holder of an account name subscribed to.

The incoming mail server can authenticate a confirmation code (but not the attached message) by means of the shared secret. Note that it is not necessary for the MAC algorithm to be standardized, it is sufficient for the sender and receiver to use the same one.

Subscription Process

The confirmation code is generated during the subscription process and communicated to the mailing list server.

We assume that any subscription process involves a confirmation process by means of an email callback loop challenge. The incoming mail server detects a mailing list request for subscription confirmation and causes it to be redirected so that the appropriate confirmation code can be added.

The callback loop challenge contains a new header constructed as follows:

Challenge-Code: <mailing-list> <recipient> <opaque>

Where

mailing-list
The string the mailing list will use in the Mailing-List header

recipient
The email address of the recipient

opaque
An opaque code defined by the mailing list confirmation server to be used for confirmation purposes.

If the user responds to the confirmation request the appropriate challenge code is generated and forwarded to the indicated address along with the opaque code to establish that the user did intend to subscribe.

Legacy Subscriptions

The confirmation code protocol must support gradual introduction. It must be possible for a mailing list to deploy the protocol without having to re-subscribe existing users. It would be advantageous if this can be achieved in such a way that allows a mailing list administrator to be able to deploy the protocol incrementally and still be able to establish that unsolicited messages are never sent.

When a mailing list server sends a message to a legacy subscriber the confirmation-code header is still present but only the date field is filled, the confirmation field is absent. This alerts the incoming server to the fact that the message is purported to be a legacy subscription.

If the incoming server supports the confirmation code protocol it may query the user to determine whether the subscription is actually authorized and if so provide the relevant confirmation code to the mailing list server to avoid the need for subsequent authorizations.

A similar procedure may be employed when the confirmation code protocol is deployed on an existing incoming mail server. In this case it is recommended that the service provide some mechanism to allow the user to send confirmation codes to a group of mailing lists at the same time.

Mailing list administrators may defend themselves against malicious allegations by having a copy of the mailing list signed by a digital notary at the time the protocol is deployed. The signature format may be chosen in such a way that validity of each entry in the list may be determined independently without revealing any information about any other list member. Various schemes suggest themselves including use of Merkle hash trees or the XML Signature manifest object. It is recommended that any mechanism chosen use a random mask value within each entry to prevent attackers from finding out that a specified party has subscribed to a list.

This information could be made available through some form of protocol, however it is likely that requiring existing users to re-subscribe will prove more attractive.

Revocation of Confirmation Codes

The mechanism described only provides for the authenticity of subscription requests to be established. No assurance is provided for un-subscription requests, nor is the protocol easily modified to achieve this directly.

The confirmation code is cryptographically bound to the mailing list identifier. An incoming mail server or spam filtering application can filter incoming mail on the basis of the mailing list identifier. Although this requires the service to maintain state the overhead is minimal and saves considerable resources in the long run. A mailing list may choose not to observe requests to unsubscribe but there is no incentive to do so if the messages are unlikely to be read.

Security Considerations

Replay Attack

The consent to receive mechanism provides proof that the recipient of an email message has consented to receive messages from a specified source. It does not provide definitive proof that a particular message originates from the source specified.

An attacker that obtains a consent token for a particular recipient/sender combination can generate an arbitrary volume of messages containing the token.

This vulnerability is unlikely to represent a significant risk in the context of spam mitigation. It is highly unlikely that a spammer could obtain a sufficiently large number of consent tokens to make bulk distribution of spam through this mechanism feasible.

The vulnerability may be eliminated through use of the consent token mechanism in combination with an authentication mechanism.

References

[RFC2369] http://www.ietf.org/rfc/rfc2369.txt

[ListSpec] http://www.nisto.com/listspec/

[RSS] http://blogs.law.harvard.edu/tech/rss