Re: CGI::Util unescape() after escape() loses utf8 flag

>On closer inspection of the source, I see that the "unescape" function can

accept a string like /u([0-9a-fA-F]{4})/ as well as /%([0-9a-fA-F]{2})/,
and it will correctly set the utf8 flag when decoding a string that matches
the former form; but the "escape" function can only produce the latter
form.
Perhaps it might be time to add an optional argument to "escape", thatwould allow for creating the "uHHHH" form? It would be simple enough todo, I'd expect.

Isn't this what the first match is doing. And should that be/u[0-9a-fA-F]{4,6}/ to allow for multi-lingual plane stuff?


Martin Hosken

<Prev in Thread]

Current Thread

[Next in Thread>

Previous by Date:

Re: CGI::Util unescape() after escape() loses utf8 flag, David Graff

Next by Date:

case folding problem on z/OS, rajarshi das

Previous by Thread:

Re: CGI::Util unescape() after escape() loses utf8 flag, David Graff

Next by Thread:

Re: CGI::Util unescape() after escape() loses utf8 flag, khadrin

Indexes:

[Date] [Thread] [Top] [All Lists]