wayne wrote:
[RFC 3986 bug wrt <uric>]
do people agree that this is really an error in the RFC?
Yes, it's IMO a clear bug in the STD, but the authors refuse to
acknowledge it, I've asked them twice about it on the URI list.
Here's my history for the news+nntp URIs ("work in progress"):
<http://article.gmane.org/gmane.org.w3c.uri/613>
Message-ID: <42E466AD(_dot_)13DA(_at_)xyzzy(_dot_)claranet(_dot_)de>
Date: Mon, 25 Jul 2005 06:12:29 +0200
Subject: STD 66 questions (problems ?)
Hi, I've found two or three potential problems in RfC 3986 appendix D.2,
or I simply don't get the idea (still hunting those non-uric characters
known as "unsafe" in RfC 1738, or as <delims> and <unwise> in RfC 2396).
1 - uric, with appendix D.2 I get:
1738 XCHAR: ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _
2396 URIC : ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _ ~
2396 URIC : ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _ ~
3986 URIC_D2: ALNUM $ % & + , - . / : ; = ? @ _ ~
3986 D.2 doesn't add <reserved> like 1738 and 2396. If I try to fix it
by adding <reserved> also in 3986 the result is more plausible:
2396 URIC : ALNUM ! $ % & ' ( ) * + , - . / : ; = ? @ _ ~
3986 URIC3: ALNUM ! # $ % & ' ( ) * + , - . / : ; = ? @ [ ] _ ~
1 - mark, with appendix D.2 I get:
1738 UNRESERVED1: ALNUM ! $ ' ( ) * + , - . _
2396 UNRESERVED2: ALNUM ! ' ( ) * - . _ ~
1738 SAFE_EXTRA: ! $ ' ( ) * + , - . _
2396 MARK : ! ' ( ) * - . _ ~
In other words <mark> is the same as <unreserved> excluding <alphanum>.
2396 UNRESERVED2: ALNUM ! ' ( ) * - . _ ~
3986 UNRESERVED3: ALNUM - . _ ~
2396 MARK : ! ' ( ) * - . _ ~
3986 MARK3: ! ' ( ) * - . _ ~
In 3986 D,2 it's the same old <mark>, no proper subset of <unreserved>.
IMHO it should be only "-", ".", "_", "~".
3 - nouric, determined indirectly as all VCHAR excl. the (fixed) <uric>:
1738 UNSAFE : " # % < > [ \ ] ^ ` { | } ~
2396 DELIM_UNWISE: " # % < > [ \ ] ^ ` { | }
3986 NOURIC3 : " < > \ ^ ` { | }
Is that correct ? Is it an omission in appendix D.2 ? Something like:
| delims | <"> / "<" / ">" |
| unwise | "\" / "^" / "`" / "{" / "|" / "}" |
Apparently (?) the complete set of excluded ASCII characters would be:
ugly = CTL / SP / DQUOTE / "<" / ">" / "\" / "^" / "`" / "{" / "|" / "}"
Bye, Frank
-------
Sender Policy Framework: http://www.openspf.org/
Archives at http://archives.listbox.com/spf-discuss/current/
To unsubscribe, change your address, or temporarily deactivate your
subscription,
please go to
http://v2.listbox.com/member/?listname=spf-discuss(_at_)v2(_dot_)listbox(_dot_)com