Ella,
Do you (or anyone else) happen to know how Teletex strings
are to be sorted?The only reference that I can find is
the caseIgnoreOrderingMatch attribute, which returns True
if if the same collation order results after lower case
characters are replaced with upper case, but it doesn't
say what the collation order is.
If diacritical marks are considered nonspacing and
appear BEFORE the character they modify, sorting
the octets will lead to a very strange result --
all of the standard characters will appear first, followed by
all of the diacritical marks. That certainly isn't what one
would expect in a standard dictionary.
For example, my French-English dictionary lists the following
words in order (I hope that all of the foreign characters survive
the mail process):
bbtard, bateau, bbtiment, bbtir, bbton, batterie, biatitude.
My German-English dictionary lists Strapaze, Stra_, Strategie, so
the s-zet (looks like beta) character is sorted as though it were
the double-s which it replaces.
In Spanish, canto is followed by caqon, then caoba (mahagony),
so the n-tilde follows n.
I have no idea what the standard lexicographic ordering is for
punctuation signs, currency symbols, etc.
Surely this is defined somewhere? Glad I don't have to write the
code to support it!
Bob