Re: Gen-ART LC/Telechat review of draft-freed-sieve-in-xml-05

The thread so far has gotten difficult to follow, so I'm going to tryto reset the conversation. I think we have been disagreeing in 2 areas.

The first is how much the document should say about how much semanticknowledge of sieve is expected for editors. On rereading the thread, Ithink we're expending a lot of energy here on something that reallyisn't that important. So I yield that point.

OTOH, I think we've highlighted some interoperability could beimportant.

First, I think we are talking past each other about applyingrequirements to sieve editors in general--that is, editors that do notimplement this spec. I never intended my comments to apply to non-XMLbased sieve editors. It would be nice if they interoperated, butthat's not what this draft is about. (For that matter, if there is anyattempt to standardize their behavior beyond a hope that they allcreate correct sieve scripts, I am not aware of it.

But I think the draft creates opportunities for failure betweenimplementations of the xml format. Let me try to make my previousquestions on work group more concrete. I see 3 interop scenariosentirely among implementations of this draft. Again, _all_ of mycomments and questions _only_ concern the set of editors andprocessors that implement this draft.

1) Interop between an arbitrary xml-based editor and an arbitrary xmlto native sieve processor - I think this is covered reasonably well,and is explicitly mentioned as a goal of the draft.

2) Interop among different xml-based editors - that is, can editor Boperate on XML created by editor A, and can A operate on the resultwithout losing data? (I am not expecting that B would be able to editor render every feature supported by A). This is not well supported bythe current draft due the fact editor B would be allowed to remove anymetadata inserted by editor A.

This could be fixed with a normative requirement to not deletemetadata elements that you don't understand, and possibly not todelete elements from xml namespaces you don't understand. Failingthat, it would help a lot to offer non-normative guidance that anymetadata or extension namespace elements are likely to go AWOL betweenediting sessions.

3) Interop among different xml to sieve conversion processors - thatis, can I take an sieve script in xml, convert it to native sieve withprocessor C and convert back to xml with processor D without losingdata. Can I do the same converting from native sieve to XML and backto sieve? This scenario is jeopardized by the SHOULD (rather thanMUST) level requirement to use the structured comment format to storemetadata.

This could be fixed by strengthening the requirement to a MUST.Failing that, it would help to include the motivation for making thisa SHOULD rather than a MUST, and guidance on the consequences of notfollowing the SHOULD.

Does the work group expect to support scenarios 2 and/or 3? If theanswer is "no", then I think the draft needs a scope or applicabilitystatement to that fact. Otherwise, I am afraid implementors willapproach this with incorrect interoperability expectations.

You mentioned that there were practical engineering considerationspreventing MUST level requirements for scenarios 2 and 3--do thoseconsiderations still apply if we limit the discussion to XML-basededitors and processors that implement this draft.?

If the answer is yes to 2 or 3, then I think the draft needs thestronger normative language, or non-normative guidance I mention above.





On Aug 18, 2009, at 7:54 PM, Ned Freed wrote:

On Aug 16, 2009, at 11:01 AM, Ned Freed wrote:
[...]
>> it would be helpful to have a sentence or two somewhere (maybe
>> in the intro) to explicitly say so. My confusion might be aroundthe
>> meaning of the term "client" in this context.
>
> No, I think your confusion is that you read a lot more into thetext
> than it
> actually says. There's a pretty big difference between "no semantic
> understanding whatsoever" and "an incomplete semanticunderstanding'.
I think the confusion is that the text says very little one way ortheother. You have assumptions in mind about the semantic knowledge ofan
editor that are not explicitly stated.
On the contrary, we have made _no_ assumptions whatsoever about it.And thedraft reflects that. You, OTOH, appear to have approached this witha set ofassumptions I for one frankly don't comprehend in your head. Perhaps- and thisis just speculation on my part - this is because, as you havestated, youhaven't done much work using XML tools. If so, then you need tounderstand thatthis document assumes considerable familiarity with XML and thetools used to
manipulate it. And given the topic of the document this is a perfectly
reasonable assumption to make IMO.
A reader that was not privy to
the process of creating this draft  may come with a different set of
assumptions, and may not draw the inferences you expect them to.
In my case, it seemed counter-intuitive that an implementer would be
willing to implement sieve semantics but unwilling to deal with the
syntax.
And this is a case in point. The purpose of this specification is toprovide ameans of representing Sieve using an alternate syntax withoutchanging any ofthe language semantics. As such, the audience is *exactly* the groupof peoplewho are "willing to implement sieve semantics but unwilling to dealwith the
syntax". (And from all indications - there are now alternative XML
representatiions for many other applications formats - this is apretty large
group.)
I have to say that approaching such a specification with the ideathat it'sentire goal is counterintuitive is a pretty good recipe forconfusion on yourpart. And I don't think any amount of clarifying prose can possiblyassist you
in dealing with such a fundamental expectation mismatch.
Your "template" comment below illustrates a case where that
makes more sense.
Again, the extent to which an editor understands and can deal withSievesemantics is largely orthogonal to the representation format. Thereare extantSieve editors that don't use the XML representation and whichunderstandessentialy no Sieve semantics at all - they are controlled byembedded commentsin special formats only, and treat the Sieve material between thecomments asopaque. Just think how easy it would be for some other Sievegeneration
facility to confuse such an editor.
>
>> Is the expectation that
>> an "editor" must be semantically aware of sieve, but a processordoes
>> not (beyond the list of "controls")?
>
> The expectation is that the amount of semantic understanding an
> editor is going
> to need will very much depend on the range of operations the editor
> is able to
> perform. Simple template-based systems will only manipulatelabelled
> blocks of
> Sieve code without any understanding of what that code does. A more
> sophisticated editor might need to have a detailed knowledge of how
> blocks in
> Sieve work, or how to build conditional expressions, or even the
> details
> sematics of various tests and actions.
That paragraph clarifies a lot. I think it would be helpful toinclude
it in the draft.
I disagree. The above paragraph might make sense to have in somesort of Sieve
usage document. It's unnecessary and distracting here.
>
>> ...
>
>> Instead of round trip "conversion", I should have said round-trip
>> "editing". My concern is, if I create a script using Editor A,then
>> later edit it with Editor B, any metadata created by Editor A is
>> likely to be lost.
>
> And that's a valid concern to have. Again, there are going to be
> cases where
> one editor has no choice but to strip the information added by
> another. This is
> simply how things are; there's nothing this or any other
> representation scheme
> can do to eliminatte this possibility.
>
>> Is that the intent?
>
> It's not a matter of intent. It is simply an unavoidable reality.
>
>> If so, it's probably worth
>> mentioning that an editor needs to be able to deal rationally with
>> the
>> loss of its own metadata.
>
> First, while it is certainly desireable for all editors to havethis> characteristic, there are going to be cases where it cannotpossibly
> work this way. So this can't be a requirement.
So am I understanding correctly that it's unreasonable to expect an
editor to just leave metadata alone if it doesn't understand it,
it depends on the context. Hopefully the XML format will help makeit a little
easier to do this in some cases. But certainly not all.
and
it's also unreasonable to expect an editor to behave in a sane manner
if its metadata gets stripped?
Again, it depends on the context.
It seems like there are three choices here: You can expect editors to
preserve metadata from other editors, you can allow stripping of
metadata and expect editors to deal rationally with its loss, or you
can expect that if a user uses more than one editor over the lifetime
of a script, one or both of the editors is likely to fail in a non-
graceful way.
Did the working group really choose the third option?
It isn't a question of what was chosen. The WG came up with one ofthe simplestlanguage syntaxes imagineable - the ABNF for Sieve is *tiny* - butany languagewith sufficient flexibility to represent any sort of useful subsetof thescripts people want to write to process email is going to be onethat's toocomplex for many editors to want to understand fully. And sinceeditors aren'talways going to have full semantic understanding, they cannot beexpected inall cases to be able to manipulate the full set of possible sievesproducing by
other systems without screwing up.
Of course the WG could have imposed some requirements on this,saying in effect"you must fully inderstand Sieve in order to be a compliant editor".But such arequirement would either have been roundly ignored, or implementorswouldchoose some other language that doesn't have such requirements. Andagain, thisdocument is absoutely not the place for stating such requirements,even if they
made sense to have, which IMO they do not.
Put another way, the language you appear to be seeking here is onethat is
trivially shown to be overconstrained by engineering realities into
nonexistance.
>
> Second, even if it were appropriate to make this a requirement,this
> document
> isn't the place for it. All this document does is describe an XML
> representation for Sieve. All of the requirements it imposes are
> directed at
> the representation and the process of converting to or from that
> representation.
>
> But since there is no requirement that a Sieve editor use this XML
> representation at all - and in practice most extant Sieve editors
> operate
> directly on the native Sieve format - imposing requirements on
> editors here
> makes little if any sense.
I fail to understand why it is acceptable to put requirements on
processors but not on editors. Certainly no one would expect aneditor
that does not implement this specification to be bound by any
requirements in it.
And that's precisely the problem. Most editors operate directly onthe regularSieve representation, not the XML representation. If you want toimpose arequirement on Sieve editors, this is not the place to do it becauseyou're
only hitting a fraction of the audience.
For that matter, you already have (admittedly
weak)  2119 language referring to editors
Actually, there is exactly one constraint the document imposes oneditors (theother compliance language explains a couple thinkgs editors areexplicitly
allowed to do), which has to do with the contents of displayblock and
displaydata not being allowed to include comment close sequences.This is doneto simply conversion processing and, unlike the requirements youwant toimpose, applies only to Sieve editors operating on the XMLrepresentation. So
it is appropriate for this document to state such a requirement.
(That said, properly speaking this should be a Schema and RNGconstraint, butit turns out to be very difficult to do in those languages, so wecheated anddid it as a prose constraint. In other words, this is a kluge to getaround alimitation in the specification language, just like textdescriptions attachedto ABNF do similar stuff on a regular basis in many otherspecifications.)
But if you are unwilling to place normative requirements around this,
It isn't a question of what I'm willing or unwilling to do, butrather what I,as an individual author working on WG document, is able or unable todo. Thestuff you appear to be affter clearly doesn't belong in thisdocument or AFAICTin any other document the WG plans to produce. If you want to seevariousgeneral requirements on Sieve editors written down somewhere you'regoing to
have to convince the WG that such an effort is worth it.
it would still help quite a bit to have some non-normative guidanceto
the effect that, since there is no requirement for an editor to
preserve metadata from another editor, an editor implementation can
expect to have its metadata removed from any given script. It it does
not handle this gracefully, bad user experiences are likely toresult.
Again, while such discussion might arguably be useful, this is notthe place
for it and I'm not the one you need to convince to do it.
>> >> Why not MUST? Wouldn't violation of this requirement introduce
>> >> interoperability problems between different implementations?
>> >
>> > It's a SHOULD because the WG believed that there may be some
>> > exception cases
>> > where an alternate format makes more sense.
>
>> Can you offer (in the text) some examples of those exceptionalcases,
>> and the consequences thereof?
>
> I see no need to.
>
>> My concern is that it seems like violating the should would pretty
>> much break interoperability between processors, wouldn't it?
>
> Sure, which is why it's a SHOULD, not a MAY. Again, this is the
> compliance
> level the WG decided was appropriate. Even if I agreed with you,
> this is not a
> simple editorial nit that I can change on my own.
It has been my experience that SHOULD level requirements that both
significantly impact interoperability and offer no explicit guidance
about the consequences of violation are some of the biggest sourcesof
interoperability problems in existing specs.
I'm starting to think that the WG had very limited expectations of
interoperability between implementations that use this format.
Realistic expectations would be closer to the mark. But again, youpersist inconfusing issues inherent in automatic generation and modificationof Sievecode with this specific representation format. To the extent thisspecificationattempts to address this, it is by relieving implementorz of theburden ofhaving yet another parser and supporting yet another syntax, and byselecting asyntax which has a vast array of very powerful manipulative toolsavailable. Wehope that this will help make some of the problems inherent in thisspace a
little easier to overcome.
I
recall a sentence stating that you expected interoperability between
editors and processors. I think an average reader would expect
interoperability among multiple editor implementations and among
multiple processor implementations. If the work group did not intend
that degree of interop, it would be extremely helpful to have some
sort of applicability statement to that effect.
Again you're asking for all sorts of stuff that far, far, farexceeds the
purview of this specification.
>
>> Or at
>> least cause encoded metadata to get lost if you convert from XMLto
>> sieve using one processor, and back to xml with another?
>
> That's the obvious case where such a loss would occur.
>
>> >
>> >> -- Security Considerations, last paragraph:
>> >
>> >> You mention that potentially executable content can be
>> introduced via
>> >> other namespaces, and that "appropriate security precautions"
>> should
>> >> be taken. I think this needs more discussion, as I am notsure an
>> >> implementor will understand what the authors considered
>> appropriate.
>> >
>> > The point of Sieve namespaces is to allow multiple XMLvocabularies
>> > to be used
>> > in a single document. This is a completely open endedmechanism and
>> > it is not
>> > our intent to label any particular use as inappropriate. Assuch,
>> > unless you
>> > have some specific text in mind, I for one fail to see whatcould
>> be
>> > added here
>> > that would be useful.
>
>> Maybe an examples of the sorts of bad behavior that could beenabled
>> by this would help.
>
> I think introducing another XML vocabulary into this documentsimply
> for
> purposes of showing that you can put bad stuff in XML would be
> belaboring the
> obvious.
>
>> Are you concerned that a scriptable editor that
>> stores scripts in metadata could be attacked by hand codingscripts
>> into structured comments in native Sieve?
>
> For that to happen there would have to be a pretty serious bug inthe
> conversion process, so no, this is not the concern here at all.
>
>> Buffer overflow attacks on
>> conversion processors?
>
> This would be another sort of conversion process bug and not
> relevant to the
> concern at hand.
>
> All this text is doing is point out the rather obvious fact thatXML
> namespaces allow you to mix vocabularies in a single document. As
> such, it
> is possible to drag in some other vocabulary that has its own setof
> security
> problems.
>
> If this still isn't clear to you I'm sorry, but I'm at a loss as to
> how
> to explain it further.
I think it's clear to me after reading your explanation. Am I correct
in understanding that the point of that sentence was that any given
namespace mayl have its own set of security considerations, and that
is beyond the scope of this document? If that is a correct
understanding, then I suggest replacing the last sentence with
something to the effect of:
"Such facilities will come with their own sets of security
considerations, which are beyond the scope of this document."
I really don't think this is that much clearer, but I can live withchanging
it to read:

Such material will necessarily have its own security
considerations, which are beyond the scope of this document.
Also, you elided one of the questions from my previous email without
responding:
>
>
>> -- Section 4.1, paragraph 11:  "Implementations MAY use this to
>> represent complex data
>> about that sieve such as a natural language representation ofsieve
>>   or a way to provide the sieve script directly."
>
>> I'm not sure I understand the last part --are you saying thiscan be
>> used as an alternate encoding of the script?
>
> Of course not. Since when do we have programs capalable of taking
> completely
> arbitrary natural language statements and reliably encoding theminto
> programming language statements?
>
> I see nothing unclear about this at all.
I get the part about representing a "natural languagerepresentation",
but what did you intend by "... or a way to provide the script
directly"?
My intent was to say exactly what was said - a UI could present Sieve
statements directly to the user. Really, I cannot see anythingunclear about
this at all and I am completely at a loss to explain it furhter.

                                Ned


_______________________________________________
Ietf mailing list
Ietf(_at_)ietf(_dot_)org
https://www.ietf.org/mailman/listinfo/ietf