Re: Symbol matching

In article 
<Pine(_dot_)LNX(_dot_)4(_dot_)21(_dot_)0105041552380(_dot_)31987-100000(_at_)bigears(_dot_)ncst(_dot_)ernet(_dot_)in>
philip(_at_)konark(_dot_)ncst(_dot_)ernet(_dot_)in writes:

However, looking through the source, I see no evidence of this
happening.  Could anyone provide pointers on where this would be done?


There is wordcount_sub() function in mknmz.

sub wordcount_sub ($$\%) {
    my ($text, $weight, $word_count) = @_;

    # Count frequencies of words in a current document.
    # Handle symbols as follows.
    #
    # tcp/ip      ->  tcp/ip,     tcp,      ip
    # (tcp/ip)    ->  (tcp/ip),   tcp/ip,   tcp, ip
    # ((tcpi/ip)) ->  ((tcp/ip)), (tcp/ip), tcp
    #
    # Don't do processing for nested symbols.
    # NOTE: When -K is specified, all symbols are already removed.

-- 
NOKUBI Takatsugu
E-mail: knok(_at_)daionet(_dot_)gr(_dot_)jp
        knok(_at_)namazu(_dot_)org / knok(_at_)debian(_dot_)org

<Prev in Thread]	Current Thread	[Next in Thread>
Symbol matching, Philip S Tellis RE: Symbol matching, DEVOS BASTIEN RE: Symbol matching, Philip S Tellis RE: Symbol matching, DEVOS BASTIEN RE: Symbol matching, Philip S Tellis Re: Symbol matching, NOKUBI Takatsugu <= Re: Symbol matching, Philip S Tellis Re: Symbol matching, NOKUBI Takatsugu Re: Symbol matching, Philip S Tellis Re: Symbol matching, NOKUBI Takatsugu Re: Symbol matching, Philip S Tellis Re: Symbol matching, NOKUBI Takatsugu Re: Symbol matching, Philip S Tellis

Previous by Date:	RE: Symbol matching, Philip S Tellis
Next by Date:	Re: Symbol matching, Philip S Tellis
Previous by Thread:	RE: Symbol matching, Philip S Tellis
Next by Thread:	Re: Symbol matching, Philip S Tellis
Indexes:	[Date] [Thread] [Top] [All Lists]