Re: internationalization of mail


Arnt Gulbrandsen wrote:

Bruce Lilly writes:
Aside from font issues, Unicode normalization uses huge tables whichmightnot be practical for some devices which do support email. Thereforetranscoding a space- and memory-efficient charset into Unicode maywell render the transformed message unusable or unreplyable iftransferred (forwarded, etc.) to some devices [it is not unusual formail to be forwarded to PDAs, mobile phones, pagers, etc.].
Some chap at a Unicode conference talked about a trie-basedimplementation that squeezed the tables into 35k. I believe he was fromPsion.
35k isn't huge, not even on a handheld.

For the Danger Hiptop (T-Mobile Sidekick) we represent all mail text andheaders on the device as UTF-8 for storage and network transmission andUTF-16 for display. We do not have normalization tables on the deviceand so far have not not missed them. The tables we do have are forcharacter classes (since that data is required to be Java compliant) andfor sorting. We cheat a little on the sorting by not having collationdata for characters that are not in any of our fonts.

The only real downside I've found to not keeping the original encodingson the device is that there is no way to reencode text whose charset wasmislabeled without refetching it over the network. But in practice Isee a lot more text that is completely unlabeled or systematicallymislabeled (claiming to be ISO-8859-1 instead of windows-1252, forinstance) than that claims to be one thing but is actually somethingsubstantially different.


Eric

<Prev in Thread]	Current Thread	[Next in Thread>
Re: internationalization of mail, (continued) Re: internationalization of mail, Nick Ing-Simmons Re: internationalization of mail, Tex Texin Re: internationalization of mail, Laird Breyer Re: internationalization of mail, Arnt Gulbrandsen Re: internationalization of mail, Laird Breyer Re: internationalization of mail, Arnt Gulbrandsen Re: internationalization of mail, Laird Breyer Re: internationalization of mail, Arnt Gulbrandsen internationalization of mail, Bruce Lilly Re: internationalization of mail, Arnt Gulbrandsen Re: internationalization of mail, Eric Fischer <=