Re: X-* header fields (Was: Getting 2822 to Draft)


Pete Resnick wrote:

On 1/4/04 at 2:54 AM -0500, Bruce Lilly wrote:
Pete Resnick wrote:
The only difference then between 822 and 2822 in this respect isthat 822 gives publication guidelines for extensions where 2822 doesnot. This has absolutely *no* effect on implementations of theprotocol.
There is a minor effect. A user-defined field (per 822 definition)can be recognized as such by examining the first two octets of thefield name, which is quite efficient.
But *why* would an implementation care about the publicationguidelines for the field? That's the *only* thing "X-" tells you.

An implementation *might* be written so as to hand off parsing of X-fields to a separateprocedure, or ignore them entirely. And the ability to efficientlybypass checking against

the 100+ standardized field names *might* be considered important.

[my implementation does hand off most X- fields (one notable exceptionbeing X-Accept-Language)and unrecognized (in this case anything not standardized in an RFC or anInternet Draft thatseems likely to become an RFC) to optional user-supplied functions.Separate functions maybe specified for unrecognized X- (user-defined) fields and forunrecognized extension fields(or the application can point to a single function). Were it not forhandling X-Accept-Language,I might have elected to bypass lookup for standard field names (viagperf) in the case of user-

defined fields.]

"X-" serves to differentiate user-defined fields from non-definedfields (i.e. those field names for which there is no IETF publisheddefinition or which the implementation does not recognize).
Again, what use is there in making that distinction? Do you think thefact that "X-Priority" and "X-Face" start with "X-" means that youshouldn't support them in your implementation or they don't havewell-defined syntax? What about "X-Sender"?

I don't support them because I haven't found a specification for them inany RFC.

Is "List-ID" a non-defined field if your implementation doesn'trecognize it? What useful purpose is there in differentiating fieldswhich start with "X-"?

I do recognize and support List-ID (RFC 2919). Flexibility is onereason to differentiate;because RFC 822 made the distinction, it is conceivable that somemessage-processingapplication may wish to treat user-defined and non-standard extensionfields differently.By providing for separate function pointers for the two cases, anapplication using the mparselibrary can either treat them differently (using distinct functionpointers) or treat them

identically (by providing two pointers to a single function).

In order to be at all useful to implementors, registration would haveto be contingent upon the existence of a stable, formal, publicdefinition of the proposed field's syntax (with ABNF) and semantics.
Nonsense. It is useful to an implementation (especially animplementation that generates fields) to know that a field exists fora particular purpose even if its syntax and semantics have notundergone extensive review and comment or are still under development.It is by implementation that fields get stable and then they can bedocumented as standards.


Example:

somebody registers a "Foo" field, but provides no public syntax orsemantics. As an implementor,

what am I supposed to do about "Foo"?

It's difficult to see how such problems are either unique to X- orare more of a hindrance to interoperability than for other fields. Asan example, consider "Status" which was in "private" use (I believe)by BSD "mailx" decades ago, and which is currently in use by severalother MUAs, and which *does* leak out; there is also a formaldefinition of a "Status" field -- incompatible with the private usage-- defined as one of the delivery status notification fields (RFC3464). So neither leakage nor incompatibility seem to be unique to X-fields. Note that if BSD mailx' author(s) had used X-Status forprivate use, there would be no conflict with the formal DSN Statusfield.
First of all, the "Status" field of DSN is not defined to appear in atop-level header of a [2]822 message, so there is no "incompatibility"between the two. But let's talk about the leakage: Yes, the "Status"field leaks. Now, what can be done about that? Well, it could bedocumented in an RFC so that everyone could know what it means. And ifturned out useful, it could be a standardized top-level header field.But what if mailx's author had instead used "X-Status"? In that case,it's DOA, because by definition it could not be documented in astandard way. So there is no way for an implementor to figure out what"X-Status" means other than by word of mouth. That invitesincompatibility. So what exactly would that "X-" have gained you?

My implementation -- and I believe that it is not unique in this respect-- parses a field witha given name depending on that name and not specifically on itscontext. The incompatibilityis that the DSN Status field has a specific syntax for the field body(viz. three dot-separatednumbers) which is not matched by the BSD et al usage of Status (a stringof alphabetic characters).A single field name with two different syntax definitions (and differentsemantics) would IMObe a bad thing, even if theoretically they could be differentiated bycontext; header fields dooccasionally end up in message bodies due to some software inserting anempty line, and thatempty line may cause top-level message headers moved into the body toappear to be MIME-partfields (i.e. if the message header still contains an appropriateContent-Type field). Mailx usesStatus to store state metadata about the mail store, which is a privateuse. X-Status could bedocumented (e.g. via an Informative RFC), but not registered with IANAas an extension field,though *as* private use, there would be no reason to document it (giventhat X- guarantees nocollision with any future registered extension field). The only reasonfor an implementor to careabout a hypothetical X-Status field is if said implementor had reason tointeract with a messagestore on a system that also used mailx with that message store -- that'sa storage issue, not a messageformat or transmission issue and therefore an issue outside of the scopeof IETF. So X-Status would

not "invite incompatibility" in any sense in which it matters to IETF.

There is one very good reason to use X- for private or experimentaluse, viz. interoperability. Use of X- as a field name prefix forprivate or experimental use guarantees that there will be no conflictwith a formal field name. Use of other names can lead to conflicts
No it doesn't. Some "X-" field names are now just as formal as somenon-"X-" field names.[...]

"[J]ust as formal"? Really? Which X- field names are defined inStandards-track RFCs? For thatmatter which X- field names are defined in *any* RFC? Or by "non-'X-'field names" do you

mean something other than registered extension field names?

An issue not mentioned above is migration of X- fields to a formaldefinition following successful experimental use. That obviouslyentails a name change. However name changes are not uncommon, e.g.refer to the MIXER RFCs, which define a number of fields whose nameshave changed. That merely means that parsers need to recognize onename as a synonym for another.
But what about generators? Because there will be parsers out therethat will only interpret the "X-" form of the field, the generatorsmust continue to send the "X-" form. Furthermore, updating someparsers is non-trivial.

By the same argument, generators will have to generate multiple versionswith a single non-X- nameas syntax changes during experimentation. Presumably thoseparticipating in an experiment have avested interest in moving forward with an official implementation at theend of the period of experimentation.

Whether or not some parsers are difficult to update is irrelevant to theissue; if a parser can be updated tohandle Obsoletes/Supersedes, Expiry-Date/Expires,Content-Identifier/X400-Content-Identifier, etc., thenit can be updated to handle X-Accept-Language/Accept-Language.Conversely if it cannot be updatedto handle X-Accept-Language/Accept-Language, then presumably it cannothandle the other (standard)

fields whose field names have been changed.

pr