Re: [xsl] Cannot write more than one result document to the same URI

At 2013-04-04 19:37 -0700, Dan Vint wrote:

I can live with the rule, just would like to understand the logic.


Consider the following scenario.  An XML document has two elements <b>:

  <a>
    <b id="1">...</b>
    <b id="2">...</b>
  </a>

An XSLT stylesheet uses the built-in template rule for <a> and has atemplate rule for :


   <xsl:template match="b">
     <xsl:result-document href="output.xml">
       <xsl:copy-of select="."/>
     </xsl:result-document>
   </xsl:template>

If the specification allowed this, then without considering theopportunities of parallelism, one might come to the conclusion thatthe file "output.xml" would always contain:


    <b id="1">...</b><b id="2">...</b>

The problem is that the specification does not require the XSLTprocessor to complete the processing of the first before startingor even ending the processing of the second . Sure asingle-process implementation "X" likely would. But a parallelized(is that a word?) implementation "Y" running on multiple CPUs couldvery well fully process the second before the first if itchose to do so. Its only obligation is to arrange the resulting treewith the result of processing the first before the result ofprocessing the second . This obligation ensures that the resultof processing by "X" is identical to the result of processing by"Y". But there is no obligation on what the processor does to get tothat result.

When using <xsl:result-document> the processor is not building theresult tree. It is creating a completely separate result. If theinstruction required "re-opening" of the file for append, processor"X" likely would produce the expected result, but processor "Y" inthe situation above would produce an unexpected result. Twoprocessors would produce two results.

And this is also why one cannot assert that the writing to the fileis even finished before the next attempt to write to the filestarts. The file handle could very well still be left open by oneparallel process when the other is ready to open it for itself. Soit can't be used even if the file is opened for write and not for append.

Note that some of my students have come to class thinking that youhave to fully complete an <xsl:result-document> before startinganother one to another URI, but I tell them that is not thecase. You can nest <xsl:result-document> instructions to differentURI target locations, and the nested <xsl:result-document> willcomplete the nested file and resume the "outer" file output whendone, without having to close and re-open the outer file. This hasbeen a very handy feature when fragmenting files. And this would beanother reason not to allow the same URI to be used.

I know the what now, even though I don't understand why the rule exists ;-)


I hope the explanation above has helped.

. . . . . . . . . Ken

--
Contact us for world-wide XML consulting and instructor-led training |
Free 5-hour lecture: http://www.CraneSoftwrights.com/links/udemy.htm |
Crane Softwrights Ltd.            http://www.CraneSoftwrights.com/s/ |
G. Ken Holman                   mailto:gkholman(_at_)CraneSoftwrights(_dot_)com 
|
Google+ profile: https://plus.google.com/116832879756988317389/about |
Legal business disclaimers:    http://www.CraneSoftwrights.com/legal |


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--