On Fri, Aug 29, 2003 at 11:00:38AM +0200, Sven Neuhaus wrote:
Hi,
I'm seeing a script using XML::Simple go berserk (eats CPU + Memory) when
feeding it XML with illegal UTF-8.
The perl debugger is telling me it's jumping around in Encode.pm
between line 187 ("sub decode_utf8") and line 246 ("*decode = sub {...").
It's doing something like:
my $str = Encode::decode_utf8($octets);
and then
return undef unless utf8::decode($str);
(2 functions calling each other).
Is this a known bug in Encode.pm? Has it been fixed?
Maybe and maybe. Could you show the illegal UTF-8?
My Encode.pm is Version 1.75. It's part of the perl debian package
5.8.0-19 (debian unstable).
I wish Encode.pm would handle invalid UTF-8 in a graceful manner...
Cheers,
-Sven
--
Jarkko Hietaniemi <jhi(_at_)iki(_dot_)fi> http://www.iki.fi/jhi/ "There is this
special
biologist word we use for 'stable'. It is 'dead'." -- Jack Cohen