ietf
[Top] [All Lists]

Re: I-D file formats and internationalization

2005-12-01 11:04:45

On Dec 1, 2005, at 8:34 AM, Paul Hoffman wrote:
The suggestion of the HTML escape would ensure readability.

Fully disagree. Listing an author as Patrik Fältström is not "readable".

The suggestion was for an alternate field in the XML input file to contain non-ASCII versions of authors and titles of references used to augment the ASCII version when the output form allows. The idea of a escape mechanism was to allow the definition of non-ASCII characters elsewhere as perhaps could be needed to clarify protocol related issues.

It
would allow an alternative display that remains compatible with an ASCII
limitation as the authoritative version.

It would mix escaped HTML into text/plain documents. It would also make RFCs that talk about HTML extremely difficult to read, because the reader would not know if entities in examples are supposed to be the escaped or unescaped versions.

This would require a convention with respect to what gets converted. How often are numeric HTML Unicode characters used? If this appears to be problematic, then simply retain the escape sequence in all output forms.


UTF-8 use would require
additional considerations regarding searching however.

Please list those; they would be valuable for the Internet Draft.

You talk about some of the issues in your draft. Even when just ASCII is used, there is difficulty discerning differences between characters, where one's ability is largely determined by familiarity with a particular font style. Cyrillic could be an example of there being more than one character-repertoire that may be used. Going from 94+ characters to thousands is obviously more of a challenge for those wishing to pose a search.

Perhaps within a few years, as Unicode becomes more ubiquitous and such issues have been resolved with better tools, resistance against full adoption may be less. In my view, the place to start would be with the ID and alternative output forms, and not the ASCII RFC. It seems regimenting where exceptions are allowed makes sense, where an authoritative ASCII document is retained for now. At some point in time, you will be right.

-Doug

_______________________________________________
Ietf mailing list
Ietf(_at_)ietf(_dot_)org
https://www1.ietf.org/mailman/listinfo/ietf