perl-unicode

Re: Weird interaction of ord, split, and substr with UTF-8?

2000-10-31 09:58:03
At 6:20 AM +0100 10/31/00, Andreas J. Koenig wrote:
>>>>> On Mon, 30 Oct 2000 19:02:25 -0800, Paul Hoffman <phoffman(_at_)proper(_dot_)com> said:

> Has anyone else come across this? Is there a way to use ord in a loop after a split
 > that works?

The bug has been fixed in the development version a while after 5.7.0
came out. You find instructions on how to get at the patches in the
perlhack manpage.

Thanks. However, I can't find a patch at <http://www.xray.mpe.mpg.de/mailing-lists/perl5-porters/> that seems related to the bug. I searched for "utf-8 ord". Is there a patch number you can give me?

Also, I'd like to distribute my code to others who probably won't have a patched system. Thus, I'd love to find a way, even a kludgy way, in 5.6.0 to split up a string into utf-8 characters that will work with ord. If need be, I could even use Unicode::String, convert to a UCS-4, slice into four-octet chunks, then convert them back to a UTF-8, but I'd like something less ugly to show the public.