xsl-list
[Top] [All Lists]

RE: How to get a list of XPath out of XMl schema documents - clarification

2005-02-25 03:44:18
Nice to see you quoting David Wheeler - the inventor of the subroutine, it
is said. He was a great computer scientist, with a wry sense of humour, but
an appallingly bad lecturer. I think I was one of only two students who
stuck it to the end of his course on numerical algorithms, and that was more
out of sympathy than because I was learning anything. 
 
It seems that you do indeed want to work from the schema rather than from an
instance document.

There are several difficulties in doing this. 

Firstly, any global element declaration in the schema is a potential
top-level element in an instance document. You could get around this by
supplying the root element name as an input to your generator, which would
then only produce paths starting at this element.

Secondly, wildcards. Simplest solution would be, if you find an element
wildcard (xs:any), just output the path with a trailing "//*".

Thirdly, namespaces. If you only want a solution for your own schemas and
your own schemas don't use namespaces, then you can ignore this problem and
the it becomes about 90% easier. But of course a general purpose solution
has to tackle this.

Fourthly, include/import/redefine. Again, you can wish this problem away if
it doesn't affect you.

Your particular schema doesn't even include named model groups or named
complex types, which makes it very simple. However, sooner or later you're
going to find your schema using these extra levels of indirection, and as
your quote from David Wheeler rightly points out, this is going to create
extra problems.

So the question really is, how general purpose do you want your solution to
be? If you restrict it to simple examples like the one quoted, it's very
easy, but to handle schemas in general, it's tough.

Michael Kay
http://www.saxonica.com/


-----Original Message-----
From: "Michael Lindenau (geschäftlich)" 
[mailto:michael(_dot_)lindenau(_at_)loesungen-und-ideen(_dot_)de] 
Sent: 25 February 2005 10:21
To: xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com
Subject: [xsl] How to get a list of XPath out of XMl schema 
documents - clarification

First - thanks to everybody for the fast answers and for the hints so 
far (even that I didn't get Dimitres example to run - using Stylus 
Studio with MSXML/MSXML 4.0). This seems to me a great and helpful 
mailing list.

Second - The XPaths of the XML schema documents are used to create 
mappings between EDI field descriptions and the XML file in 
which they 
should be transferred. The  XML schemas are the base for the mapping 
design. First intention is, to control if every given XPath in the 
mapping documents (usually Excel sheets) is correct written. Next 
intention is, to get an easy diff about the ongoing changes 
in the XML 
schema documents. So hierarchy doesn't matter. It's only 
important that 
the list contains every used unique XPath to an named element. 
Namespaces are not used. Indeed the XML schemata are very simple and 
don't use much of the possibilities that an XML schema offers. But at 
least they group elements to complex elements and use 
references. There 
is no elaborated software intended, which will use this information.

Below is a concrete example of such a schema

The desired XSLT would produce a list like

/OBBISOFT/MESSAGE/MESSAGE_NO
/OBBISOFT/MESSAGE/MESSAGE_VERSION_NO
/OBBISOFT/MESSAGE/MESSAGE_SENDER/MAIN
/OBBISOFT/MESSAGE/MESSAGE_SENDER/CENTER
/OBBISOFT/MESSAGE/MESSAGE_SENDER/TEAM
...

which lists only the elements.

A list like
/OBBISOFT
/OBBISOFT/MESSAGE
/OBBISOFT/MESSAGE/MESSAGE_NO
wouldn't be the desired output.

<?xml version="1.0" encoding="UTF-8"?>
<!--***************************************************-->
<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema"; 
elementFormDefault="qualified">
    <!--***************************************************-->
    <!--** SIMPLE TYPES-->
    <!--***************************************************-->
    <xs:simpleType name="NotEmptyString">
        <xs:restriction base="xs:string">
            <xs:minLength value="1"/>
        </xs:restriction>
    </xs:simpleType>
    <xs:simpleType name="DurationType">
        <xs:restriction base="xs:string">
            <xs:minLength value="1"/>
        </xs:restriction>
    </xs:simpleType>
    <xs:simpleType name="BooleanType">
        <xs:restriction base="xs:string">
            <xs:enumeration value="TRUE"/>
            <xs:enumeration value="FALSE"/>
        </xs:restriction>
    </xs:simpleType>
    <!--***************************************************-->
    <!--** ELEMENTS-->
    <!--***************************************************-->
    <!--ROLEType-->
    <!--NotEmptyString-->
    <xs:element name="CENTER" type="NotEmptyString"/>
    <xs:element name="CTO_NO" type="NotEmptyString"/>
    <xs:element name="ERROR_ID" type="NotEmptyString">
        <xs:annotation>
            <xs:documentation>Type of error</xs:documentation>
        </xs:annotation>
    </xs:element>
    <xs:element name="ERROR_KIND" type="NotEmptyString">
        <xs:annotation>
            <xs:documentation>Describes the nature of the error

See list of domain values</xs:documentation>
        </xs:annotation>
    </xs:element>
    <xs:element name="ERROR_MEANING" type="NotEmptyString"/>
    <xs:element name="EXT_SYSTEM" type="NotEmptyString">
        <xs:annotation>
            <xs:documentation>For external message consignee or 
dedicated system name</xs:documentation>
        </xs:annotation>
    </xs:element>
    <xs:element name="MAIN" type="NotEmptyString"/>
    <xs:element name="MESSAGE_NO" type="NotEmptyString">
        <xs:annotation>
            <xs:documentation>Running counter of the message 
for unique 
identification</xs:documentation>
        </xs:annotation>
    </xs:element>
    <xs:element name="REFERENCE" type="NotEmptyString"/>
    <xs:element name="TEAM" type="NotEmptyString"/>
    <xs:element name="TOURNEE" type="NotEmptyString"/>
    <!--***************************************************-->
    <!--long-->
    <!--***************************************************-->
    <!--int-->
    <xs:element name="MESSAGE_VERSION_NO">
        <xs:annotation>
            <xs:documentation>Design version of the 
message</xs:documentation>
        </xs:annotation>
        <xs:simpleType>
            <xs:restriction base="xs:integer">
                <xs:minInclusive value="0"/>
                <xs:maxInclusive value="999"/>
            </xs:restriction>
        </xs:simpleType>
    </xs:element>
    <!--***************************************************-->
    <!--decimal-->
    <!--***************************************************-->
    <!--date-->
    <!--***************************************************-->
    <!--duration-->
    <!--***************************************************-->
    <!--boolean-->
    <xs:element name="TRUCK_NO" type="NotEmptyString"/>
    <xs:element name="OBBISOFT">
        <xs:complexType>
            <xs:sequence>
                <xs:element name="MESSAGE">
                    <xs:complexType>
                        <xs:sequence>
                            <xs:element ref="MESSAGE_NO"/>
                            <xs:element ref="MESSAGE_VERSION_NO"/>
                            <xs:element name="MSG_SENDER">
                                <xs:annotation>
                                    <xs:documentation>Organisational 
unit sending the message</xs:documentation>
                                </xs:annotation>
                                <xs:complexType>
                                    <xs:choice>
                                        <xs:annotation>
                                            <xs:documentation>1 or 
Both</xs:documentation>
                                        </xs:annotation>
                                        <xs:choice>
                                            <xs:element 
name="ORGANIZATION">
                                                <xs:annotation>
                                                    
<xs:documentation>For DHL internal message 
consignee</xs:documentation>
                                                </xs:annotation>
                                                <xs:complexType>
                                                    <xs:sequence>
                                                        <xs:element 
ref="MAIN"/>
                                                        <xs:element 
ref="CENTER"/>
                                                        <xs:element 
ref="TEAM"/>
                                                    </xs:sequence>
                                                </xs:complexType>
                                            </xs:element>
                                            <xs:element 
ref="EXT_SYSTEM"/>
                                        </xs:choice>
                                        <xs:sequence>
                                            <xs:annotation>
                                                <xs:documentation/>
                                            </xs:annotation>
                                            <xs:element 
name="ORGANIZATION">
                                                <xs:annotation>
                                                    
<xs:documentation>For DHL internal message 
consignee</xs:documentation>
                                                </xs:annotation>
                                                <xs:complexType>
                                                    <xs:sequence>
                                                        <xs:element 
ref="MAIN"/>
                                                        <xs:element 
ref="CENTER"/>
                                                        <xs:element 
ref="TEAM"/>
                                                    </xs:sequence>
                                                </xs:complexType>
                                            </xs:element>
                                            <xs:element 
ref="EXT_SYSTEM"/>
                                        </xs:sequence>
                                    </xs:choice>
                                </xs:complexType>
                            </xs:element>
                            <xs:element name="MSG_CONSIGNEE">
                                <xs:annotation>
                                    <xs:documentation>Organisational 
unit receiving the message</xs:documentation>
                                </xs:annotation>
                                <xs:complexType>
                                    <xs:choice>
                                        <xs:annotation>
                                            <xs:documentation>1 or 
Both</xs:documentation>
                                        </xs:annotation>
                                        <xs:choice>
                                            <xs:element 
name="ORGANIZATION">
                                                <xs:annotation>
                                                    
<xs:documentation>For DHL internal message 
consignee</xs:documentation>
                                                </xs:annotation>
                                                <xs:complexType>
                                                    <xs:sequence>
                                                        <xs:element 
ref="MAIN"/>
                                                        <xs:element 
ref="CENTER"/>
                                                        <xs:element 
ref="TEAM"/>
                                                    </xs:sequence>
                                                </xs:complexType>
                                            </xs:element>
                                            <xs:element 
ref="EXT_SYSTEM"/>
                                        </xs:choice>
                                        <xs:sequence>
                                            <xs:element 
name="ORGANIZATION">
                                                <xs:annotation>
                                                    
<xs:documentation>For DHL internal message 
consignee</xs:documentation>
                                                </xs:annotation>
                                                <xs:complexType>
                                                    <xs:sequence>
                                                        <xs:element 
ref="MAIN"/>
                                                        <xs:element 
ref="CENTER"/>
                                                        <xs:element 
ref="TEAM"/>
                                                    </xs:sequence>
                                                </xs:complexType>
                                            </xs:element>
                                            <xs:element 
ref="EXT_SYSTEM"/>
                                        </xs:sequence>
                                    </xs:choice>
                                </xs:complexType>
                            </xs:element>
                            <xs:element name="MSG_ERROR" 
minOccurs="0" 
maxOccurs="unbounded">
                                <xs:annotation>
                                    
<xs:documentation>Contains the error 
messages generated by the MessageManager module during 
integration/extraction</xs:documentation>
                                </xs:annotation>
                                <xs:complexType>
                                    <xs:sequence>
                                        <xs:element ref="ERROR_ID"/>
                                        <xs:element 
ref="ERROR_MEANING"/>
                                        <xs:element ref="REFERENCE" 
minOccurs="0"/>
                                    </xs:sequence>
                                </xs:complexType>
                            </xs:element>
                            <xs:choice>
                                <xs:element name="STANDARD_MESSAGE">
                                    <xs:annotation>
                                        <xs:documentation>NPS 
messages</xs:documentation>
                                    </xs:annotation>
                                    <xs:complexType>
                                        <xs:choice>
                                            <xs:choice>
                                                <xs:annotation>
                                                    
<xs:documentation>NPS Messages</xs:documentation>
                                                </xs:annotation>
                                                <xs:element 
name="DC_INPUT">
                                                    <xs:annotation>
                                                        
<xs:documentation>Input from the Swedish Tour Optimisation Software

Please note: The MAIN is specified in  ../MSG_CONSIGNEE/MAIN
</xs:documentation>
                                                    </xs:annotation>
                                                    <xs:complexType>
                                                        <xs:sequence>
                                                            
<xs:element 
name="DC_INPUTS" maxOccurs="unbounded">
                                                                
<xs:complexType>
                                                                    
<xs:sequence>
                                                              
          
<xs:annotation>
                                                              
          
    <xs:documentation>CTO_NO
Contains the DHL identification number of the (delivery-) 
operation that 
has been optimised.

SUBBAY_ID is a dedicated area inside a terminal where the 
goods for the 
above operation will be stored

TOURNEE contains the standard round trip to be used to 
execute the operation

TRUCK_NO =
registration plate number of the truck performing the round trip
</xs:documentation>
                                                              
          
</xs:annotation>
                                                              
          
<xs:element ref="CTO_NO"/>
                                                              
          
<xs:element name="SUBBAY_ID" type="NotEmptyString"/>
                                                              
          
<xs:element ref="TOURNEE" minOccurs="0"/>
                                                              
          
<xs:element ref="TRUCK_NO" minOccurs="0"/>
                                                                    
</xs:sequence>
                                                                
</xs:complexType>
                                                            
</xs:element>
                                                        </xs:sequence>
                                                    </xs:complexType>
                                                </xs:element>
                                            </xs:choice>
                                        </xs:choice>
                                    </xs:complexType>
                                </xs:element>
                            </xs:choice>
                        </xs:sequence>
                    </xs:complexType>
                </xs:element>
            </xs:sequence>
        </xs:complexType>
    </xs:element>
</xs:schema>


-- 

Kind regards

Michael Lindenau
Lösungen+Ideen

"Any problem in computer science can be solved with
 another layer of indirection. But that usually will
 create another problem!" <David Wheeler>

mobile: +49 (0) 179 29 28 834
e-mail: michael(_dot_)lindenau(_at_)loesungen-und-ideen(_dot_)de

-------------------------------------------------------
Lösungen+Ideen

Alejandro Puskin No 1, Portal H, 5C
29011 Málaga
ES - España
-------------------------------------------------------

This email is confidential. If you are not the intended
recipient, you must not disclose or use the information
contained in it. If you have received this mail in error,
please tell me immediately by return email and delete
the document.


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: 
<mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--





--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--



<Prev in Thread] Current Thread [Next in Thread>