[Xapian-discuss] Text extractor

Sebastjan Trepca trepca at gmail.com
Tue Aug 9 21:29:44 BST 2005


Hi!

I was wondering if you have any tips about extracting postings/terms
from an article. Right now I have this lame extractor which just just
splits the article with a space into terms and adds them to a
document, but of course terms like "blah," can be problematic.
Well, if that's even the right way to do this :) 

I'm working with Python bindings,btw.

Thanks, 
Sebastjan



More information about the Xapian-discuss mailing list