At 03:33 PM 8/28/2006, Andrew wrote:
Wendell Piez wrote:
Dear Dimitre,
At 08:41 PM 8/27/2006, you wrote:
I want to use a single, short word to express the act of removing
duplicates from a node-set. I remember seing the word "de-duplication"
used, however it sounds ugly.
Normalisation
Normalization (or 'normalisation' for those who prefer British
orthography) would rather be the general process of transforming a
set of values into their normalized forms. So,
<date value="2006">May Day 2006</date>
<date value="2006-05-01"/>
<date value="5-1-2006">May 1 2006</date>
might be normalized as
<date value="2006-05-01">May 1 2006</date>
<date value="2006-05-01">May 1 2006</date>
<date value="2006-05-01">May 1 2006</date>
but this would not deduplicate them.
These are very different problems, especially for XSLT. Generally
speaking, deduplicating requires normalization first since
deduplication works only over canonical forms (or comparing them to
see which are duplicates becomes very difficult).
Cheers,
Wendell
--~------------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--