mhonarc-users

Re: How to avoid auto-linking in non-ascii URLs

2006-03-24 19:03:01
In <200603242013(_dot_)k2OKDq0V003723(_at_)gator(_dot_)earlhood(_dot_)com>,
 earl(_at_)earlhood(_dot_)com wrote:
If so, I think that MHonARC, even in default settings, should treat
these characters as invalid URL characters in URL linking code.

The URL linking code is a single regex operation.

Entity reference resolution would need to be done, and I am not
sure it is worth the effort for url linking.

Patches are welcome.

Okay, I see.

The following is another simple workaround for URL linking in default
setting.

Thanks,
Masao


diff -rub MHonArc-2.6.15.orig/lib/mhtxtplain.pl MHonArc-2.6.15/lib/mhtxtplain.pl
--- MHonArc-2.6.15.orig/lib/mhtxtplain.pl       2005-06-20 13:25:18.000000000 
+0900
+++ MHonArc-2.6.15/lib/mhtxtplain.pl    2006-03-25 09:48:19.861240096 +0900
@@ -43,7 +43,7 @@
                         q/[^\s\(\)\|<>"'\0-\037]+/ .
                         q/[^\.?!;,"'\|\[\]\(\)\s<>\0-\037]/;
 $HUrlExp        = $readmail::UrlRxStr .
-                        q/(?:&(?![gl]t;)|[^\s\(\)\|<>"'\&\0-\037])+/ .
+                        
q/(?:&(?!(?:[gl]t|\#x[0-9A-F]+);)|[^\s\(\)\|<>"'\&\0-\037])+/ .
                         q/[^\.?!;,"'\|\[\]\(\)\s<>\&\0-\037]/;
 $QuoteChars    = '[>]';
 $HQuoteChars   = '&gt;';

<Prev in Thread] Current Thread [Next in Thread>