Re: Getting RFC 2047 encoding right

The basic answer is that what you do with illegal input is generallynot specified - but clearly you aren't expected to make the subjectof the reply match the subject of the message being replied to inthat case.
Right.
What I cannot see is how to make something reasonable, correct andfairly simple.
In most cases I have code that is right when the input is good, andnot wrong when the input is bad. RFC 2047 just doesn't seem to makethat simple.

Well for untagged text basically you just have to guess the charset.ISO-2022-* and UTF-8 can be distinguished from other charsets simplyand fairly reliably, and you can make guesses at some of the othersusing heuristics. It's difficult to tune the heuristics, and subjectlines are too brief for them to work really well. But I really don'tsee how RFC 2047 makes determining the charset label of untagged textany worse than it inherently is.

Suppose I want to answer with "subject: re: <original> <ticketid>", then I risk having two encoded-words separated only bywhitespace, and must do magic in order to preserve that space.
why not just use an ASCII ticket id?
Why should I make "always ASCII" a requirement for that case, in codethat otherwise allows all of Unicode?

For the same reason that you should probably avoid using some forms ofemail addresses even though they are perfectly valid - such as "Keith\"Mr. Cynic\" Moore"@cs.utk.edu - corner cases that are seldom seenoften fail in practice.

If you want to be entirely reliable your code to detect ticket-ids hasto be able to find them whether or not they're embedded inencoded-words. And it's not as if you can't put a ticket-id into anencoded-word, though (as you point out) you might have to encode %20 asthe first character of that encoded-word.

<Prev in Thread]	Current Thread	[Next in Thread>
Re: Getting RFC 2047 encoding right, (continued) Re: Getting RFC 2047 encoding right, Arnt Gulbrandsen Re: Getting RFC 2047 encoding right, Keith Moore Re: Getting RFC 2047 encoding right, Arnt Gulbrandsen Re: Getting RFC 2047 encoding right, Keith Moore Re: Getting RFC 2047 encoding right, Arnt Gulbrandsen Re: Getting RFC 2047 encoding right, Keith Moore Re: Getting RFC 2047 encoding right, Michael Bell Re: Getting RFC 2047 encoding right, Keith Moore Re: Getting RFC 2047 encoding right, Charles Lindsey Re: Getting RFC 2047 encoding right, Arnt Gulbrandsen Re: Getting RFC 2047 encoding right, Keith Moore <=