xsl-list
[Top] [All Lists]

Re: [xsl] convert XML to NFC

2011-04-28 07:38:28
Does this imply that élem (with LATIN SMALL LETTER E WITH ACUTE) and
élem (with LATIN SMALL LETTER E and COMBINING ACUTE ACCENT) are two
different element names? That would be a source of hidden errors,
because no editor will show the difference.

Simon

On Wed, Apr 27, 2011 at 21:39, Liam R E Quin <liam(_at_)w3(_dot_)org> wrote:
On Wed, 2011-04-27 at 17:14 +0100, David Carlisle wrote:
well it's probably quicker to use a tool treating the file as plain text
(since the unicode normalisation should not break the xml)

A breakage to watch for - XML names don't have to be normalised, and
normalization changes them, so e.g. id values, entity names, element
names may be affected, and might no longer match corresponding names in
a DTD or schema, and cross-document links may also be affected.

If this is the case I'd argue the original input was broken, but if you
need to normalise it, you already suspect that...

For that reason, you might want to use XSLT with a slightly-modified
identity transform as suggested, to leave the markup unaffected.

Liam


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--

<Prev in Thread] Current Thread [Next in Thread>