Some time ago, there was some mail about WebGlimpse and
I am looking for a convenient Search Engine to index information
within a dozen of different web sites.
I had already installed HtDig on a test Linux machine to learn
how to tune it. HtDig has quite a lot of implementations including
some famous sites as NASA or Mercedes Benz.
I'd like to know if any of you has some experience of both the
Search Engines: HtDig and WebGlimpse.
Which one has the lowest cpu overhead for indexing ?
Which one has the lowest rate of index disk occupation ?
Some HtDig features (so far):
- customization of indexing depth
- list of URL to include
- list of URL to exclude
- all languages supported
- fine tuning of word's weight
Thanks for the answers,