In article
<Pine(_dot_)LNX(_dot_)4(_dot_)21(_dot_)0105041552380(_dot_)31987-100000(_at_)bigears(_dot_)ncst(_dot_)ernet(_dot_)in>
philip(_at_)konark(_dot_)ncst(_dot_)ernet(_dot_)in writes:
However, looking through the source, I see no evidence of this
happening. Could anyone provide pointers on where this would be done?
There is wordcount_sub() function in mknmz.
sub wordcount_sub ($$\%) {
my ($text, $weight, $word_count) = @_;
# Count frequencies of words in a current document.
# Handle symbols as follows.
#
# tcp/ip -> tcp/ip, tcp, ip
# (tcp/ip) -> (tcp/ip), tcp/ip, tcp, ip
# ((tcpi/ip)) -> ((tcp/ip)), (tcp/ip), tcp
#
# Don't do processing for nested symbols.
# NOTE: When -K is specified, all symbols are already removed.
--
NOKUBI Takatsugu
E-mail: knok(_at_)daionet(_dot_)gr(_dot_)jp
knok(_at_)namazu(_dot_)org / knok(_at_)debian(_dot_)org