ietf-822
[Top] [All Lists]

Dealing with invalid UTF-8

2003-09-11 02:38:28

Hi,

this morning I received a single-part mail message looking like this:

Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
...
b2VrcmlzIEVuZ2luZWVyaW5nLCB0ZWNobmljYWwgZGlzY3Vzc2lvbiBtYWlsaW5nIGxpc3QNCgk+ IFt1bl1zdWJzY3JpYmU6IGh0dHA6Ly9saXN0cy5zb2VrcmlzLmNvbS9tYWlsbWFuL2xpc3RpbmZv
L3NvZWtyaXMtdGVjaA0KCT4NCgkNCgkNCg0K
_____________________________________________________________________
Soekris Engineering, technical discussion mailing list
[un]subscribe: http://lists.soekris.com/mailman/listinfo/soekris-tech

What's considered the best approach for dealing with something like this? ("Complain to GNU Mailman maintainer", while doubtless satisfying to some, is not my thing. I suppose I'm growing old.)

The RFC states that "Any characters outside of the base64 alphabet are to be ignored in base64-encoded data". That approach leads to appended garbage. Stopping decoding as soon as an illegal character is seen works in this case, but perhaps it might lead to truncation in other error cases?

Comments? Advice?

--Arnt


<Prev in Thread] Current Thread [Next in Thread>