xsl-list
[Top] [All Lists]

Re: [xsl] Efficient way to do an identity transform, eliminating duplicate elements, in XSLT 1.0?

2013-12-13 12:15:27
So, how big is "big"?

My new home PC I bought in May has 32GB RAM.

It seems obvious that paying $2000 for a decent computer is so much
cheaper than paying programmers (myself included) to invent tricks ---
at least for this kind of problems.





On Fri, Dec 13, 2013 at 6:33 AM, Costello, Roger L. 
<costello(_at_)mitre(_dot_)org> wrote:
Hi Folks,

I need to do an identity transform on XML files like this:

<Document>
    <First>
        <id>A</id>
        <blah>B</blah>
        <id>A</id>
    </First>
    <Second>
        <id>C</id>
        <blah>D</blah>
        <id>C</id>
    </Second>
</Document>

I want the identity transform to remove duplicate elements in <First> and 
remove duplicate elements in <Second>. So the output should be:

<Document>
    <First>
        <id>A</id>
        <blah>B</blah>
    </First>
    <Second>
        <id>C</id>
        <blah>D</blah>
    </Second>
</Document>

I need to use XSLT 1.0 to implement this.

I created an implementation, but it uses <copy> statements. The actual XML 
document that I am transforming is huge, nearly 1 GB. When I run my XSLT 
implementation the processor runs out of memory. I think it's due to the 
<copy> statements. I need a very efficient implementation. Any suggestions?

/Roger

--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: 
<mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--




-- 
Cheers,
Dimitre Novatchev
---------------------------------------
Truly great madness cannot be achieved without significant intelligence.
---------------------------------------
To invent, you need a good imagination and a pile of junk
-------------------------------------
Never fight an inanimate object
-------------------------------------
To avoid situations in which you might make mistakes may be the
biggest mistake of all
------------------------------------
Quality means doing it right when no one is looking.
-------------------------------------
You've achieved success in your field when you don't know whether what
you're doing is work or play
-------------------------------------
To achieve the impossible dream, try going to sleep.
-------------------------------------
Facts do not cease to exist because they are ignored.
-------------------------------------
Typing monkeys will write all Shakespeare's works in 200yrs.Will they
write all patents, too? :)
-------------------------------------
I finally figured out the only reason to be alive is to enjoy it.

--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--