[Xapian-discuss] auto stopwords

Olly Betts olly at survex.com
Wed Dec 15 11:02:10 GMT 2004


On Wed, Dec 15, 2004 at 09:53:55AM +0000, James Aylett wrote:
> On Wed, Dec 15, 2004 at 09:43:55AM +0000, Sam Liddicott wrote:
> > As well as within-document-frequency and within-index-frequency is there 
> > any benefit in keeping the not-in-document-frequency, or the number of 
> > documents that do NOT contain a given term?
> 
> Isn't this Database::get_doccount() -  TermIterator::get_termfreq() ?
> 
> (Or Database::get_doccount() - Database::get_termfreq(tname) .)

Yes.  And even if you wanted to, you can't sanely store it precomputed
since every stored value will need to be updated every time a document
is added.

Cheers,
    Olly



More information about the Xapian-discuss mailing list