xsl-list
[Top] [All Lists]

RE: [xsl] XPath Question (related to Java)

2007-06-24 12:18:39
In the XPath data model, you see nodes rather than markup. That's why
there's no "<" present. Instead, the Definition element will have a child
that is a <sub> element.

Evaluating the expression as a string will give you the string value of the
node, this is the concatenation of all the contained text, ignoring the
markup.

You seem to want to serialize the node as XML, to reinstate the markup.
There's no direct way of doing that in the XPath API; you probably have to
do an identity transformation from a DOMSource containing the node to a
StreamResult. (You'll have to change your call to retrieve a NODESET rather
than a STRING). Alternatively there may be a method such as toXML() on the
DOM Node object - I've forgotten.

Michael Kay
http://www.saxonica.com/ 

-----Original Message-----
From: Grant Slade [mailto:grant(_dot_)slade(_at_)gmail(_dot_)com] 
Sent: 24 June 2007 19:03
To: xsl-list(_at_)lists(_dot_)mulberrytech(_dot_)com
Subject: [xsl] XPath Question (related to Java)

Hi, I have the following xml which gets read from a file as 
part of a Node:
            <Definition> An organic compound in which the 
aldehyde group (HC=O) is connected to a branched or 
unbranched open chain of carbon atoms rather than a ring. 
Some aldehydes are created during the reactions of oxidants 
used as disinfectants, particularly ozone (O<sub>3</sub>), 
with natural organic matter. </Definition>

When I run it through the following method  it ignores the 
<sub></sub>:
      public String getDefinitionFromNode(Node node) throws 
javax.xml.xpath.XPathExpressionException
      {
            XPath xpath = XPathFactory.newInstance().newXPath();
            String definitionExpression = "Definition";
            String definition = (String) 
xpath.evaluate(definitionExpression, node, XPathConstants.STRING);
            if(definition.contains("<"))
                  System.out.println ("found a <");
            else
            {
                  System.out.println ("did not find a <");
            }
            return definition;
      }

When the program runs, it outputs the following:

did not find a <
--------------------------------
<dictionary n=""><TermName>aliphatic 
aldehyde</TermName><Definition>An organic compound in which 
the aldehyde group (HC=O) is connected to a branched or 
unbranched open chain of carbon atoms rather than a ring.
Some aldehydes are created during the reactions of oxidants 
used as disinfectants, particularly ozone (O3), with natural 
organic matter.</Definition></dictionary>

How do I get it to output the <sub></sub> elements?

The complete node is:
        <Term>
            <Entry> aliphatic aldehyde </Entry>
            <Definition> An organic compound in which the 
aldehyde group (HC=O) is connected to a
                branched or unbranched open chain of carbon 
atoms rather than a ring. Some aldehydes
                are created during the reactions of oxidants 
used as disinfectants, particularly
                ozone (O<sub>3</sub>), with natural organic matter.
</Definition>
            <SeeAlso>disinfection by-product</SeeAlso>
            <IMAGE fileName="A-17.gif"/>
        </Term>

--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: 
<mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--



--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--

<Prev in Thread] Current Thread [Next in Thread>