Re: Was: [xsl] mode and moved to Namespaces

Hi Liam,

Thank you for your interesting response.

I agree with you that, as I noted also, namespaces take some getting useto, and typos can be ... surprising.

Let's also be clear that I have no document with 80 or so namespaces,only stylesheets that deal with a lot of different document types.

As for translating dictionaries, what I tried to point out was not thatthere was a syntactic link between namespaces and content languages,only that conceptually natural languages, for example, were "conceptual"namespaces, implying, or trying to imply that they may be a naturalmatch for (XML) namespaces, in a translation dictionary context. Yes Ican create a dictionary like

<dic>
<word>
<instance xml:lang="en" gender="m">Mr</instance>
<instance xml:lang="en" gender="f">Mrs</instance>
<instance xml:lang="fr" gender="m">M.</instance>
<instance xml:lang="fr" gender="f">Mme</instance>
        ...
</word>
</dic>

but, given the proper namespace declarations, I could also have it as
<dic>

<word en:instance="Mr" en-f:instance="Mrs" fr:instance="M."fr-f:instance="Mme" ... />

    ...
</dic>

in both cases, source documents are using xml:lang to determine what topick, but:

1. while the second version could use en, en-f, fr, fr-f for values, thefirst would either require a second attribute for gender or a piece ofcode to split the xml:lang attribute, to match two attributes instead ofa namespace2. in the first version, the example entry uses 17 nodes while thesecond version uses 5, and more languages would increase the difference3. as you point out, namespaces are syntactic constructs, that probablyget resolved at compile time, rather than run time

Which brings up the question that the first version is trying to getaround using namespaces for something that is conceptuallynamespace-based, with some real physical, transformational, and designcosts.

I did not want to get into the discrepancies of RDF, but just triedpointing out that stylesheets and transformation pipelines may have todeal with multiple document types, with the associated namespaces.OTOH, I have been thinking and addressing the RDF discrepancies andwould go beyond what you suggest, but it may be more appropriate to takethis off-list, if you care.

Having been using 80 namespaces in a stylesheet, I quite realize thatthere are no "8 namespaces" limit. It was Andrew, and Gerritindirectly, that suggested that 8 namespaces could be a reasonablemaximum for a stylesheet. As well, both Michael and Gerrit pointed outthat a binary search may prove more efficient for managing namespaces,beyond that "reasonable" maximum. I question the reasonableness of thesuggested maximum and feel that indeed, a binary namespace search wouldbe a useful feature.

As for namespace-based specific knowledge domain transformations, onemay have, for example, many attributes and elements, in transformationpipeline streams, for example, that hold, let's say GML content that mayrequire typical parsing or transformation of some sort. So either eachtransformation stylesheets has to know and match all possible attributeand element names that may contain such data, or all these attributesand elements are put in a common namespace and a template simply matchesthe namespace, to processes all possible related items. Designwise, Imuch prefer the latter, even if it implies defining a namespace for timeintervals and another for space intervals, and another for ...

As you suggest, one could also put all the target items into sub elementunder some convention and systematically impose that convention,somewhat like:

<timeinterval>
<morning>...</morning>
<after-noon>...</after-noon>
</timeinterval>

knowing that every element under <timeinterval> should be processedaccording to time interval transformations (e.g. <xsl:apply-templatesselect="timeinterval/*" mode="timeinterval"/>)but, as this involves more nodes, less flexibility, and more processing,again, it feels like a workaround and patch for not using namespaces,when namespaces fit naturally, more efficiently, and more simply.


As for remembering the namespaces and prefixes, I would recommend:
1. define only required namespaces
2. define namespaces and prefixes carefully

3. break large stylesheets into smaller ones, typically around thenamespaces and associated application domains4. look at the top of the stylesheet to find (and define) namespaces andtheir prefixes5. its mostly the stylesheets, the stylesheet documentation, thestylesheet authors, and the individual domain document authors that needto remember the namespaces that each is working withThis is still better than having to remember the dummy elements thatwould have to be added otherwise, apart from the additional associateddocument and transformation overhead. These special items would alsonot be formally declared at the top of the stylesheets, making redundentdocumentation a requirement. Note also that typos on the markingelements could also create issues.

Again, those RDF issues, especially for semantic relevance andcoherence, have been resolved with much better grounded foundations, andI would be happy to discuss and present on this, in a more appropriatecontext.


Regards,
Andre

On Sun, 2011-04-17 at 19:30 -0400, ac wrote:

I am surprised that, with all these XML and XSLT gurus around the table,
using more than 8 namespaces in a stylesheet or application, seems like
such a strange, "out of bounds", thing.

There's no hard limit, but in general the more you use, the harder life
will get.  It's up to you to remember them all... declare them... debug
errors when an XSLT template doesn't match because of a typo in a prefix
or URI...

Don't natural languages at least each have their own "natural"
namespace? If an application supports i18n and localization, should it
use less namespaces than the number of locale it supports?

Namespaces are 100% unrelated to content language.

Use xml:lang to indicate language. You don't need a different namespace
for different content.

  Should one not use RDF when using StratML, or XSD,
or Atom?

Well, it's unfortunate that the RDF people were farly clueless about XML
when they insisted on the namespace deign we have today.  RDF/XML
confuses the syntactic and the semantic, the envelope and the contents.
But then, RDF/XML confused a lot of things (if Jonathan is reading,
"Mona lisa is a jpeg, and she is 700 pixels wide and hangs in the
Louvre" or one of my books had similarly crazy examples based on the
not-even-well-formed vcard examples in the rdf spec.)

<triple><r>resource 1</r><rel>relation</re><r>resource 2</r></triple>
would have done just fine, with, if needed,
<triple><r>resource 1</r><rel auth="MESH">relation</re><r>resource
2</r></triple>
to identify a naming authority.

But it's not that common for a single document to be using all those
different things. And just because the RDF community thinks everything
is a URI is no reason to start thinking everything is a namespace.

To answer your question, though, yes, people often do mix vocabularies.
Sometimes it's better to map to and from other vocabularies only at the
boundaries of your system, and to have a single, simpler language you
use internally.

Should names like "position" be in the same namespace whether it is
referring to time, or space, or both?

I don't see why not. All the namespace does is disambiguate elements
from *DIFFERENT* organizations or maintainers.  If it's you that's
writing the vocabularies, you're not likely to need namespaces to avoid
conflicts with your own names.  Namespaces in XML are purely syntactic.
For that matter,
<staff><position><name>Director</name><duties.... is fine too, in
context.

For XSLT you can have match="staff/position" as easily (more easily)
than you can have match="staff:position".

What kind of XML data should stylesheets transform, and to what XML data
should they transform it to, so that stylesheets do not use more than 8
namespaces?

There's no limit of 8 namespaces.

For that matter there's no 8 megabyte limit on the length of an element
name in XML. Nor 8 gigabytes.

Sure, you can do pretty neat things with stuff that's syntactically XML
but that's out well beyond the norm... sometimes if you do, though,
you'll end up fighting both tools and people.  There's no hard cut-off,
and if what you're doing catches on, maybe the XML community will change
over time in this direction. So I won't say it's bad, or it's good, but
only, it's not commonly done today to have 80 or more namespaces in one
document.  Once you get over half a dozen they are going to get pretty
hard for most people to remember!

Liam


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe(_at_)lists(_dot_)mulberrytech(_dot_)com>
--~--