Re: If VARIABLE = (this|orthis|orthat)

At 10:26 2001-11-29 -0600, David W. Tamkin wrote:

Sean suggested to Timothy,

| * REPLYTO ?? 
(\<|^)(joe(_at_)aol\(_dot_)com|ed(_at_)msn\(_dot_)com|john(_at_)yahoo\(_dot_)com)(\>|$)

\< and \> already match newlines, so alternating them with ^ and $ is
unnecessary.

While it applies to the trailing newline, it DOES NOT with the leadinganchor, since there is no _newline_ present at that end of the string(you're welcome to make a test script to demonstrate this toyourself). Since formail typically extracts a space in front of thereturned header, the WORD BREAK matches (on a space, not a newline). Ifyou've extracted the header as a raw address without the leading space(perhaps because you plan to do something else with it), say, like follows:


SENDER=`formail -b -xFrom:`

# Strip leading whitespace from the sender
:0
* SENDER ?? ^[  ]*\/[^  ].*
{
        SENDER=$MATCH
}

You will need the BOL anchor in the rule to ensure you're anchoring in theevent that the match text is at the beginning of the string. FTR, a zeroor more (or alternatley or'ing with a null string) defeats the purpose,since if there IS something immediatley preceeding the text, it'll still match.

However, since formail -r will never extract more than one
return address, Holger's recommendation to use ^^ at each end will work, and
since ^^ will not match a hyphen or a period, it's preferable.

This holds true if you wish to use the regexp specifically for the envelopeaddress and the envelope address ONLY. As I believe I explained, otheraddresses (if you extract the "From:" for instance) may have additionalcrud around the address.

I *DID* point out that rolling out the \< regexps and adding additionalcharacters to the exclusion would improve the matching, I just didn'texpand that within the example myself, leaving it as an excercise for thereader.


That would result in a rule similar to:

:0:
* SENDER ?? (^|[^-a-zA-Z0-9_.])($useraddrsexpression)($|[^-a-zA-Z0-9_.])
test.match

(so shoot me if I think continuing to include the EOL explicitly in theregexp simply makes it clearer)

The astute reader will recognize that the suggestion expansion of the \<and \> macros with the addition of '.' and '-' characters makes them theSAME as the subexpression used in the ^TO_ macro. Coincidence?

This for instance will ensure that a firstname-lastname(_at_)domain orfirstname(_dot_)lastname(_at_)domain doesn't match on lastname(_at_)domain, and thatuser(_at_)domain(_dot_)com(_dot_)net or user(_at_)domain(_dot_)net-com(_dot_)com doesn't match when you'relooking for user(_at_)domain(_dot_)com

My apologies if my example wasn't optimized for the limitations of theoriginal expression - my intent was to offer a rule which could be usedsuccessfully on a wider variety of input data.


---
 Sean B. Straw / Professional Software Engineering

 Procmail disclaimer: <http://www.professional.org/procmail/disclaimer.html>
 Please DO NOT carbon me on list replies.  I'll get my copy from the list.

_______________________________________________
procmail mailing list
procmail(_at_)lists(_dot_)RWTH-Aachen(_dot_)DE
http://MailMan.RWTH-Aachen.DE/mailman/listinfo/procmail

<Prev in Thread]	Current Thread	[Next in Thread>
If VARIABLE = (this\|orthis\|orthat), Timothy J. Luoma Re: If VARIABLE = (this\|orthis\|orthat), Professional Software Engineering Re: If VARIABLE = (this\|orthis\|orthat), Holger Wahlen Re: If VARIABLE = (this\|orthis\|orthat), David W. Tamkin Message not available Re: If VARIABLE = (this\|orthis\|orthat), Professional Software Engineering <= Re: If VARIABLE = (this\|orthis\|orthat), Professional Software Engineering Re: If VARIABLE = (this\|orthis\|orthat), David W. Tamkin Message not available Re: If VARIABLE = (this\|orthis\|orthat), Professional Software Engineering Re: If VARIABLE = (this\|orthis\|orthat), David W. Tamkin Re: If VARIABLE = (this\|orthis\|orthat), Professional Software Engineering Re: If VARIABLE = (this\|orthis\|orthat), David W. Tamkin Re: If VARIABLE = (this\|orthis\|orthat) --2, Tim Luoma