XML2RFC must die, was: Re: Two different threads

My apologies for the subject line. I'm very disappointed that thesilent majority of draft authors isn't speaking up. I can't imaginethat the vast majority of draft authors has absolutely no problemswith XML2RFC. So I'm assuming they've been ignoring the thread,hopefully the new subject line will get some of them to chime in. Ifthat doesn't happen I'll shut up and try to figure out why I have somuch trouble with something that nobody else finds difficult.


On 4 jul 2009, at 13:27, John Levine wrote:

I think it's reasonable to assume that going forward the vast majority
of users who read online documents will be able to use software that
can reformat them in various ways.  This tells me that although the
publication form has to be readable in a pinch as plain text, it's
more important that it's amenable to mechanical processing.  Tidily
formatted xml2rfc would be a reasonable candidate

No, it's not. The problem with XML2RFC formatted drafts and RFCs isthat you can't display them reasonably without using XML2RFC, andalthough XML2RFC can run on many systems in theory, in practice it'svery difficult to install and run successfully because it's written inTCL and many XML2RFC files depend on the local availability ofreferences. When those aren't present the conversion fails.

The philosophy behind XML2RFC is to encode meaning in the XML whereverpossible, rather than simply display text. There are several problemswith that:

1. It makes it hard to write source files, because now rather thantype "Experimental" at the top of the file, I have to know whatXML2RFC looks for to determine the draft's status. Same thing withboilerplate, references, etc.

2. It makes it hard to read source files for the same reason. Youcan't read an XML2RFC formatted XML file without prior knowledge andget all the information that would be displayed in the final draft/RFCformat.

3. It gets it wrong. XML2RFC "knows" that you create a name from aninitial, a period, a space and a last name. So initial "I" and lastname "Van Beijnum" becomes "I. Van Beijnum". However, XML2RFC doesn'tknow that in Dutch, certain last name prefixes are capitalized if theyappear at the beginning of the name (Van Beijnum) but not if they'rein the middle because there are first names or initials: "I. vanBeijnum".

This means that the makers of XML2RFC spent a lot of time making thetool require the authors to spend a lot of time to create somethingthat is sometimes incorrect, with no means to correct the problem. Anall-around waste of time.

Then there is the problem with XML in general. Now apparently thereare XML editors that can make sure you create syntactically correctXML without having to take care of all the details manually. But assomeone who has otherwise no need to write XML, I'm not familiar withthose tools. So I write my XML2RFC source by hand. The result is thatI invariably get error messages that the <section> and </section> tagsdon't match properly. This is a problem that is extremely hard todebug manually, especially as just grepping for "section" isn'tenough: there could be a , </middle> etc somewhere between a<section> and </section> that breaks everything.

First writing a source file and then compiling it into an output fileis no longer something something that is familiar to most people. WhenI write anything other than a draft, I can simply select "header level2" and I know that everything will be taken care of. I don't have toexplicitly tell my word processor where the text following a headerlevel 2 ends, because the presence of another header makes that clearboth to me and to the software.

What we need is the ability to write drafts with a standard issue wordprocessor. I'm sure that sentence conjured up nightmares of Worddocuments with insane formatting being mailed around cluelessbeaurocracies, but that's not what I mean. Word processors use stylesto tag headings, text, quotes, lists and so on: the exact same stuffthat you can do in XML but rather than having to think about it(especially closing all tags correctly) it happens easily,automatically and without getting in the way. (I can even change thestyle for an entire paragraph with a single menu selection or functionkey without having to find the beginnings and ends of that paragraph.)

Formatting is then based on the style tags, with all explicitformatting aplied by the word processor removed. This is standardoperating procedure in 99% of publishing. (The other 1% beingscientific/engineering books where the authors send in Latex.)

All the stuff that can't be handled by styles should just be copiedand pasted from the boilerplate, without the need for tools to knowabout the structure of these things. (At least not in the draft stage,perhaps this can be useful in the final stages of RFC editing.)

_______________________________________________
Ietf mailing list
Ietf(_at_)ietf(_dot_)org
https://www.ietf.org/mailman/listinfo/ietf

XML2RFC must die, was: Re: Two different threads - IETF Document Format