[Xapian-discuss] [ NUMBER OF SAMPLE ]
Boris Meyer
boris.meyer at rom.fr
Wed Jul 21 09:51:15 BST 2004
Hello Richard,
Richard Boulton wrote:
> Boris Meyer wrote:
>
>> I'm currently testing Xapian with Omega, all is running perfectly.
>
> >
>
>> But most of the time the "sample" result returned (extract from the
>> document while the indexing process) don't match the user query.
>> Many positive results, but no Higlight, cause no match.
>>
>> Is it possible to extract more sample during the indexing process ?
>
>
> I assume you're using omindex.
Absolutely.
> There is currently no command line option which allows you to change the sample size, but it is trivial to
> tweak the source so that a larger sample is produced. Look at around
> line 423 of omindex.cc, and change the number 300 in the lines:
I had exactly looked at this line in omindex.cc and thought the same, I
will try on this way.
> if (sample.empty()) {
> sample = truncate_to_word(dump, 300);
> } else {
> sample = truncate_to_word(sample, 300);
> }
>
> to a larger value. This number is the maximum size in bytes of the
> sample produced.
I'll recompile with a larger value, but ideally I would want a 100% sure
way to obtain a meaningful sample, I have to find how to get the number
of char of a document during the indexation process.
> Hope this helps,
Many thanks for your answer and suggestion, I will post the result obtained.
--
Cordialement, Boris.
+---------------------------+----------------------+
| Boris Meyer | Tel : 04 93 92 88 88 |
| Administration / Internet | Fax : 04 93 92 18 93 |
| Developpement | Web : http://rom.fr |
+---------------------------+----------------------+
| 19, bd Carabacel | - - - - - x - - - - |
| 06000 Nice | - - - - - x - - - - |
+---------------------------+----------------------+
| boris.meyer at rom.fr | http://www.rom.fr |
+---------------------------+----------------------+
More information about the Xapian-discuss
mailing list