Are you trying to do all this using the standalone XSLT or XQuery
processor or are you thinking of writing a Java application that
queries the directory content to get the small files ? (otherwise if
the big file has no reference to the small files you can't "load"
them).
In any case if memory is not a problem (both the XSLT and the XQuery
processor operate to the in-memory representation of the full file) I
would say use the language you are more familiar with. You should be
able to do what you want in both.
If memory is a problem you may want to look at the streaming APIs in
"nux" (http://dsd.lbl.gov/nux/). You could use XPath to scan the big
file and XQuery to match the single node with the small files. It uses
Saxon as the XPath/XQuery processor. It may not be powerful enough (in
order to "stream" it limits the XPath functionalities that you can use)
but it's worth a look.
-- Raffaele
On Jan 5, 2005, at 2:53 PM, alan m wrote:
I did a post of this in another forum but did not yet
get a reply.
I have extremely large (over 300 MB) XML file and tens
of thousands of small xml files generated after
applying various XSLT on the one big XML file.
I am using Saxon for XSLT and will be using it also
for XQuery.
Is Xquery or XSLT is better solution for this problem?
Query each text node in the big xml file and verify
that this content is present in one of the results xml
files. Based on this information generate a report
that shows which content is present and in which file
and in a separate section which content was not found
in result xml files and also show this content parent
element or other markup to indicate its position in
the big xml file.
All the small xml files are stored as flat files in
various directories on Windows File system although
most files are in one directory. The big XML file is
fairly complex with multiple levels of nesting
elemenents.
Any comments or suggestions?
Thank you
__________________________________
Do you Yahoo!?
Yahoo! Mail - now with 250MB free storage. Learn more.
http://info.mail.yahoo.com/mail_250
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--
!DSPAM:41dc7a2d37284051019849!
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--