[Xapian-discuss] Grouping document paragraphs

Yannick Warnier ywarnier at beeznest.org
Tue Jan 15 00:04:38 GMT 2008


Le lundi 14 janvier 2008 à 23:00 +0000, James Aylett a écrit :
> On Mon, Jan 14, 2008 at 11:51:10PM +0100, Yannick Warnier wrote:
> 
> > I want to index documents but splitted into paragraph types (title,
> > abstract, main content, etc) and then I want to retrieve every document
> > that contains a combination of "trains" AND "wheels" in any combination
> > of the paragraphs of a document, how could that work?
> 
> You could assign a value to each paragraph-as-Xapian::Document which
> identifies the source file, and then use that value number for
> Xapian::Enquire::set_collapse_key().

I didn't know that one, will investigate, thank you.

> Why do you want to index the paragraphs separately? Are you going to
> want to search for them separately in some other context?

Yes, I would like to be able to search in an article base, but only
inside titles, for example (or only on abstract sections).

Thanks

Y.




More information about the Xapian-discuss mailing list