[Xapian-discuss] [ NUMBER OF SAMPLE ]

Boris Meyer boris.meyer at rom.fr
Wed Jul 21 09:51:15 BST 2004


Hello Richard,

Richard Boulton wrote:

> Boris Meyer wrote:
> 
>> I'm currently testing Xapian with Omega, all is running perfectly.
> 
>  >
> 
>> But most of the time the "sample" result returned (extract from the 
>> document while the indexing process) don't match the user query.
>> Many positive results, but no Higlight, cause no match.
>>
>> Is it possible to extract more sample during the indexing process ?
> 
> 
> I assume you're using omindex.

Absolutely.

> There is currently no command line option which allows you to change the sample size, but it is trivial to 
> tweak the source so that a larger sample is produced.  Look at around 
> line 423 of omindex.cc, and change the number 300 in the lines:

I had exactly looked at this line in omindex.cc and thought the same, I 
will try on this way.

>     if (sample.empty()) {
>         sample = truncate_to_word(dump, 300);
>     } else {
>         sample = truncate_to_word(sample, 300);
>     }
> 
> to a larger value.  This number is the maximum size in bytes of the 
> sample produced.

I'll recompile with a larger value, but ideally I would want a 100% sure 
way to obtain a meaningful sample, I have to find how to get the number 
of char of a document during the indexation process.

> Hope this helps,

Many thanks for your answer and suggestion, I will post the result obtained.

-- 
Cordialement, Boris.
+---------------------------+----------------------+
| Boris Meyer               | Tel : 04 93 92 88 88 |
| Administration / Internet | Fax : 04 93 92 18 93 |
| Developpement             | Web : http://rom.fr  |
+---------------------------+----------------------+
| 19, bd Carabacel          | - - - - - x - - - -  |
| 06000 Nice                | - - - - - x - - - -  |
+---------------------------+----------------------+
| boris.meyer at rom.fr        | http://www.rom.fr    |
+---------------------------+----------------------+



More information about the Xapian-discuss mailing list