[Xapian-discuss] xapian performance
Fernando Nemec
fernando.nemec at folha.com.br
Wed Nov 22 20:55:21 GMT 2006
Hi Olly,
> Could you compare the speed of phrase searches with this patch:
Certainly. I use Query::get_description for each query I did along the
time to get the result set. I just made three different queries: one
term, 2 words phrase and 6 words phrase.
Do you think its better to have a large set of queries or this will do
fine?
This was made *without* experimental phrase optimization patch:
<!--Xapian::Query(lula)-->
0m0.412s
<!--Xapian::Query((presidente PHRASE 2 lula))-->
1m5.062s
<!--Xapian::Query((governo PHRASE 6 do PHRASE 6 estado PHRASE 6 de PHRASE 6 sao PHRASE 6 paulo))-->
1m14.193s
That was made *with* phrase optimization patch:
<!--Xapian::Query(lula)-->
0m0.379s
<!--Xapian::Query((presidente PHRASE 2 lula))-->
0m58.514s
<!--Xapian::Query((governo PHRASE 6 do PHRASE 6 estado PHRASE 6 de PHRASE 6 sao PHRASE 6 paulo))-->
1m2.503s
Thanks for you help Olly. If there's anything else I can do to help to
fix this issue, please let me know.
Nemec
Wednesday, November 22, 2006, 5:19:45 PM, you wrote:
> On Tue, Nov 21, 2006 at 07:16:52PM -0200, Fernando Nemec wrote:
>> After so many patches I opt to get a fresh new source copy from svn.
>> As far as I see you committed almost all patches you produced in the
>> last days.
> So far I've only committed the changes to use "my_fls" instead of the
> floating point log calculation. The changes to open positionlists
> lazily aren't in yet (I was waiting to check that the latest patch
> fixed the slowdown for 2 term phrases).
>> Sadly I didn't figure out any new improvement. I made a simple list
>> with a variety of queries and all of them return in more or less the
>> same time (a few tens of seconds).
> The "my_fls" changes should reduce CPU use, so you won't see much
> improvement if you're heavily I/O bound (which you must be if a search
> takes tens of seconds).
>> Is there any information I can supply to you to help to find what's
>> going on phrase searches?
> Could you compare the speed of phrase searches with this patch:
>> > http://www.oligarchy.co.uk/xapian/patches/xapian-experimental-phrase-optimisation-v2.patch
> with not using it (either on SVN trunk or 0.9.9). Ideally it should
> speed up phrases with 3 or more terms, but should be just as fast for
> 2 term phrases.
> I'm going to look at creating a simple patch to count the number of
> blocks read from each table during the query, which should help to get a
> handle on how much I/O we're actually doing in an easily repeatable way.
> Cheers,
> Olly
--
[]s
Fernando Nemec
fernando.nemec at folha.com.br
http://www.folha.com.br/
More information about the Xapian-discuss
mailing list