ietf-xml-mime
[Top] [All Lists]

Fw: Re: XML media types, charset, TAG findings

2004-10-07 18:49:39


Forwarded by MURATA Makoto (FAMILY Given) 
<EB2M-MRT(_at_)asahi-net(_dot_)or(_dot_)jp>
----------------------- Original Message -----------------------
From:    Chris Lilley <chris(_at_)w3(_dot_)org>
To:      Bjoern Hoehrmann <derhoermi(_at_)gmx(_dot_)net>
Date:    Thu, 7 Oct 2004 17:56:03 +0200
Subject: Re: XML media types, charset, TAG findings
----

On Thursday, October 7, 2004, 5:27:53 PM, Bjoern wrote:


BH> * Chris Lilley wrote:
Coupled with the deprecation of the text/xml and
text/xml-external-parsed-entity types (and thus insulation from the
particular encoding testrictions of text/*) we are now, in this revision
of the document, in a position to be a little stronger:

 The encoding declaration in an XML document and the charset (if
 provided) MUST be consistent.

BH> That is insufficient as it does not define what it means for these to be
BH> consistent, how implementations are required to determine whether this
BH> requirement has been met and what processors are required to do when
BH> these are determined to be inconsistent. Without a complete proposal it
BH> is most difficult to cite any reactions on this matter. I would
BH> generally support removing the often ignored complexity that the charset
BH> parameter introduces, with your proposal however, even if completely
BH> specified, I would worry that this increases the complexity rather than
BH> removing it in which case this would seem counter-productive.

This is a reasonable worry.

My preference would be to not have a redundant charset parameter, since
that would remove the ambiguity. However, I realize that people are
uncomfortable with that and thus propose this solution.

Consistent means that the encoding determined by
F Autodetection of Character Encodings (Non-Normative)
http://www.w3.org/TR/REC-xml/#sec-guessing

is either the same as the value of the charset pArameter, or the charset
parameter is not provided.

There are examples of this in the current internet draft, for various
cases including a specified encoding declaration, an absent encoding
declaration with or without assorted BOMs.



-- 
 Chris Lilley                    mailto:chris(_at_)w3(_dot_)org
 Chair, W3C SVG Working Group
 Member, W3C Technical Architecture Group

--------------------- Original Message Ends --------------------



<Prev in Thread] Current Thread [Next in Thread>