Re: [xsl] Unicode question

On Wed, May 23, 2018 at 05:41:44PM -0000, Erik Siegel 
erik(_at_)xatapult(_dot_)nl scripsit:

   I have a problem that is Unicode related. Some Unicode characters (for
   instance emojis) can have some code *following* the actual character to
   indicate a variant. For instance in the following stylesheet, the emoji
   character in $x (U+1F61C) is followed by U+DE1C. When I look in oXygen it
   shows me this. But when I run the stylesheet it reports a string length of
   1 and only a single codepoint.

   I suppose that is true, it is only  single character. But how can I find
   out (in XPath) what the value of the second “character” (indicator?) is?
   Or is that impossible anyway?


If I try to look up U+DE1C, I am informed that this is not a Unicode
code point.   It is the second half the UTF-16 surrogate pair --
D83D DE1C -- use to represent U+1F61C in UTF-16.

(See <https://apps.timwhitlock.info/unicode/inspect?s=%F0%9F%98%9C> )

I would suppose that oXygen is showing you UTF-16 source but the
processing is happening in UTF-8, where the emoji is a single code point
and corresponding glyph.

-- Graydon
--~----------------------------------------------------------------
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
EasyUnsubscribe: http://lists.mulberrytech.com/unsub/xsl-list/1167547
or by email: xsl-list-unsub(_at_)lists(_dot_)mulberrytech(_dot_)com
--~--

<Prev in Thread]	Current Thread	[Next in Thread>
[xsl] Unicode question, Erik Siegel erik(_at_)xatapult(_dot_)nl Re: [xsl] Unicode question, Michael Kay mike(_at_)saxonica(_dot_)com Re: [xsl] Unicode question, Graydon graydon(_at_)marost(_dot_)ca <= Re: [xsl] Unicode question, David Carlisle d(_dot_)p(_dot_)carlisle(_at_)gmail(_dot_)com Re: [xsl] Unicode question, Julian Reschke julian(_dot_)reschke(_at_)gmx(_dot_)de Re: [xsl] Unicode question, David Carlisle d(_dot_)p(_dot_)carlisle(_at_)gmail(_dot_)com

Previous by Date:	Re: [xsl] Unicode question, Michael Kay mike(_at_)saxonica(_dot_)com
Next by Date:	[xsl] How to access PSVI in XSLT3, Jason Polis jason(_at_)x-x(_dot_)net
Previous by Thread:	Re: [xsl] Unicode question, Michael Kay mike(_at_)saxonica(_dot_)com
Next by Thread:	Re: [xsl] Unicode question, David Carlisle d(_dot_)p(_dot_)carlisle(_at_)gmail(_dot_)com
Indexes:	[Date] [Thread] [Top] [All Lists]