perl-unicode

Re: In-Band Information Considered Harmfult

1998-10-24 13:30:56
You, John Macdonald, wrote:
++ 
++ Asking the question whether the pattern:
++ 
++     print if /Perl is terrific/i;
++ 
++ matches which of the following lines:
++ 
++     perl<!-- capitalize? --> is terrific
++     perl is<? AUDIO heavenly chorus ?> terrific
++     Perl<fnote isbn="1-56592-149-6" /> is terrific
++     perl is<emph>not so</emph> terrific
++     perl <quot>is terrific</quot>, commented Bray
++     <!ENTITY adjective "terrific">perl is &adjective;!
++ 
++ The last one requires parsing of XML before it can be made to match.
++ Other than that, though, simply ignoring all of the metadata is
++ sufficient to match correctly on 1235 and to properly not match on
++ 4.
++ 
++ So, it looks like my memory is faulty - I had though that there was
++ at least one example that had text between a pair <xx> and </xx>
++ where the meaning of xx was such that the text should *not* be
++ considered for matching purposes.  Unless the third line could be
++ written as:
++ 
++     Perl<footnote> Programming Perl, Wall et al., isbn="1-56592-149-6"
++     </footnote> is terrific


Perl<object src = "excl.gif">!</object> is terrific.
Perl is <img src = "not.gif" alt = "NOT"> terrific.



Abigail