[Xapian-discuss] auto stopwords

Sam Liddicott sam at liddicott.com
Wed Dec 15 09:43:55 GMT 2004


As well as within-document-frequency and within-index-frequency is there 
any benefit in keeping the not-in-document-frequency, or the number of 
documents that do NOT contain a given term?

It would provider an at-a-glance view of how many documents a term might 
select, it could be good for auto-stopword selection by showing how 
useless a term is as a document selector.

Just in idea

Sam



More information about the Xapian-discuss mailing list