[Xapian-discuss] Search::Xapian add_database'd search results are odd?

Eric Parusel eparusel at creativens.com
Tue Dec 21 21:32:13 GMT 2004


Sorry if this is the wrong forum to discuss Search::Xapian issues -- 
this just seems like the best place..

Anyways, I've been testing out using $db->add_database() when searching, 
and it seems like the docids I'm getting out of it are incorrect, almost 
as though they're "double" what they should be (numerically)...

the docids that exist should be around 950,000 and 1000000 not around 
1900000, etc...

$xapiandirbase . '-11'
and
$xapiandirbase . '-10' both exist.


quick example:
===================================================================
my $db = Search::Xapian::Database->new($xapiandirbase . '-11' || 
die("Error.\n"));
my $db->add_database(Search::Xapian::Database->new($ARGV[0] . '-10' || 
die("Error.\n")));

my $query = Search::Xapian::Query->new(OP_AND, 'word', 'word2');
print "Query: " . $query . "\n";
my $enq = $db->enquire($query);

my @matches = $enq->matches(0,1000000);
foreach my $match ( @matches ) {
    print $match->get_docid . ',';
}
===================================================================


If I don't use add_database, or add_database is wrapped in eval and 
fails due to me pointing to a xapian db that doesn't exist, and either 
query xapian db "-10" or "-11", I get docids returned in the 
900000-1000000 range approximately.

If I use add_database() in either order (-10 then -11, or vice versa) 
then I get ids seemingly doubled.

-10 and -11 have unique markers in them (M10 and M11 respectively), so 
if I use one of the markers as the keywords, I get the same number of 
results whether I add on the 2nd database or not.
I just get results that are seemingly approximately doubled?

I'm running Search::Xapian 0.8.4.0 and xapian 0.8.4:
# rpm -qa | grep xapian
xapian-core-0.8.4-1
xapian-core-libs-0.8.4-1
xapian-core-debuginfo-0.8.4-1
xapian-core-devel-0.8.4-1

Thanks for any help you can offer,
Eric
-------------- next part --------------
xapian-read-test.pl in original form, reading domain.com-2004-11, no add_database:

# ./xapian-read-test.pl /data1/xapian/domain.com-2004 M11 warren shannon
Query: Xapian::Query((M11 AND warren AND shannon))
Parsing query 'Xapian::Query((M11 AND warren AND shannon))'
611 results found
{975786,975788,976495,976496,976603,976604,976605,976653,976654,976655,976656,977247,978602,978603,978604,982193,982194,982195,982196,982237,982883,983505,983521,983522,983538,983539,983540,984358,984366,984367,984768,984769,984782,984924,987743,988632,988633,988634,988637,988638,988639,990152,990864,990865,990866,993950,993959,996317,996318,996319,996429,996430,996431,996503,996504,996505,996594,996595,996596,996981,1000204,1000205,1000215,1000286,1000287,1000288,1000340,1000382,1000433,1000434,1000435,1000867,1000868,1000869,977244,977245,977246,990160,975793,982927,982928,982929,994283,994284,994285,983865,983866,983867,988662,988663,988664,983762,983763,983764,977138,994079,977164,977165,977166,977167,984994,984995,984996,1001216,1001217,1001218,996426,984305,984306,984307,984352,984353,984354,996397,1000197,982933,982934,982935,1000211,984361,984362,984363,978573,989961,989962,989963,989322,994093,997031,984341,984342,983550,983551,983552,983553,989326,989327,989328,978580,983565,984826,984827,984828,994825,994826,994827,984990,984991,984992,1001031,983883,983884,983885,997075,997076,997077,984372,984373,984374,978618,996955,996956,996957,975801,975932,978553,978554,978555,982512,984902,984903,1000267,1000268,1000269,976616,976617,983819,983820,983821,989511,989512,989513,996964,996965,996966,984315,984843,984844,984845,987788,987789,987790,989466,989467,989468,977965,1000377,978000,978001,978002,978715,978716,978717,978718,978719,978720,978721,978722,978723,984864,984865,984866,977925,982923,982924,982925,995436,995437,995438,988400,988401,988402,990791,990792,976607,976608,976609,983004,995442,996973,996974,996975,984927,984928,984931,991434,991435,991436,984926,989351,989352,989353,983104,984929,984919,977517,978703,978704,978705,988537,988538,988539,993738,993739,984870,984871,984872,988783,988784,988785,989340,989341,989342,982930,982931,982932,984950,984951,984471,996214,996215,996216,996223,996224,996225,996219,996220,996221,996279,996280,996281,993758,993759,993760,993955,993956,993957,977858,977859,977860,994206,990079,990080,990081,990082,1000912,984887,984888,984889,1000929,983298,984487,984488,984489,989313,989314,989315,977864,977865,977866,1001133,975809,975810,975811,984777,984778,984779,978812,996978,975830,977936,995465,975828,978003,978004,978005,978817,988677,978825,973971,988680,975779,993839,993840,993841,983027,983028,983029,983030,977872,993860,993862,978828,990616,975822,978031,978874,978875,978876,978837,978838,978839,987814,987815,987816,988682,977206,983609,988689,988690,988691,977526,1000880,1000881,1000882,978801,983617,983618,983619,983620,990166,990167,990168,978808,978809,978810,987874,987875,987876,990170,990171,990172,978822,978823,978824,983640,990656,975974,983592,983593,983594,983595,983683,983684,983685,1000233,1000234,1000235,995500,995501,995502,995503,988613,988614,988615,996038,978831,978832,978833,987725,987726,990173,990174,990175,984878,984879,984880,984881,999531,988081,988082,988083,975829,989589,989590,989591,975676,975677,975678,975790,975791,975792,990176,996688,976048,976049,976050,989443,989444,989445,978853,987745,987746,987747,1000221,1000222,1000223,982164,982165,982166,983622,983623,983624,984144,1000650,985013,985014,985015,990408,995177,998011,984632,985018,985019,985020,982167,975883,991023,979094,985021,985022,985023,982187,982188,982189,978305,988266,982213,982214,982215,978007,978008,978009,1001188,1001189,1001190,978527,978528,978529,994592,994593,994594,990562,985184,986102,978532,978533,978534,988549,978624,978625,978626,988563,982299,982300,982301,987888,987889,989400,989401,989402,994539,994540,994541,1000225,1000226,1000227,988672,988673,988674,991809,991810,991811,976096,976097,976098,976103,976104,976105,976107,976108,976109,984382,984383,984384,975910,975911,975912,1000900,975918,975919,975920,975938,975939,975940,977326,990031,990032,990033,1001446,987948,990083,990084,990085,990182,990183,990184,990189,990190,990191,990199,990200,990201,1001255,990207,990208,990209,990090,990091,990092,990194,990195,990196,990692,990216,983571,983572,983573,1000361,1000362,976759,976760,976761,993181,995821,995822,995824,975453,987530,981892,995888,995889,995890,996415,996416,996417,978841,978842,978843,996442,996443,996444,978863,978864,978865,995893,995894,995895,995903,995904,995905}
Elapsed time: 0.0667 seconds.


# vi xapian-read-test.pl
xapian-read-test.pl modified to add-database domain.com-2004-10:

# ./xapian-read-test.pl /data1/xapian/domain.com-2004 M11 warren shannon
Query: Xapian::Query((M11 AND warren AND shannon))
Parsing query 'Xapian::Query((M11 AND warren AND shannon))'
611 results found
{1951571,1951575,1951585,1952989,1952991,1953205,1953207,1953209,1953305,1953307,1953309,1953311,1954487,1954489,1954491,1954493,1957203,1957205,1957207,1964385,1964387,1964389,1964391,1964473,1965765,1965853,1965855,1965857,1967009,1967041,1967043,1967075,1967077,1967079,1968715,1968731,1968733,1969535,1969537,1969563,1969847,1975485,1977263,1977265,1977267,1977273,1977275,1977277,1980303,1980319,1981727,1981729,1981731,1987899,1987917,1992633,1992635,1992637,1992857,1992859,1992861,1993005,1993007,1993009,1993187,1993189,1993191,1993961,2000407,2000409,2000429,2000571,2000573,2000575,2000679,2000763,2000865,2000867,2000869,2001733,2001735,2001737,1988565,1988567,1988569,1967729,1967731,1967733,1977323,1977325,1977327,1967523,1967525,1967527,1954275,1988157,1954327,1954329,1954331,1954333,1969987,1969989,1969991,2002431,2002433,2002435,1992851,1968609,1968611,1968613,1968703,1968705,1968707,1992793,2000393,1965865,1965867,1965869,2000421,1968721,1968723,1968725,1957145,1979921,1979923,1979925,1978643,1988185,1994061,1968681,1968683,1967099,1967101,1967103,1967105,1978651,1978653,1978655,1957159,1967129,1969651,1969653,1969655,1989649,1989651,1989653,1969979,1969981,1969983,2002061,1967765,1967767,1967769,1994149,1994151,1994153,1968743,1968745,1968747,1957235,1993909,1993911,1993913,1951601,1951863,1957105,1957107,1957109,1965023,1969803,1969805,2000533,2000535,2000537,1953231,1953233,1967637,1967639,1967641,1979021,1979023,1979025,1993927,1993929,1993931,1968629,1969685,1969687,1969689,1975575,1975577,1975579,1978931,1978933,1978935,1955929,2000753,1955999,1956001,1956003,1957429,1957431,1957433,1957435,1957437,1957439,1957441,1957443,1957445,1969727,1969729,1969731,1955849,1965845,1965847,1965849,1990871,1990873,1990875,1976799,1976801,1976803,1981581,1981583,1953213,1953215,1953217,1966007,1990883,1993945,1993947,1993949,1969853,1969855,1969861,1982867,1982869,1982871,1969851,1978701,1978703,1978705,1966207,1969857,1969837,1955033,1957405,1957407,1957409,1977073,1977075,1977077,1987475,1987477,1969739,1969741,1969743,1977565,1977567,1977569,1978679,1978681,1978683,1965859,1965861,1965863,1969899,1969901,1968941,1992427,1992429,1992431,1992445,1992447,1992449,1992437,1992439,1992441,1992557,1992559,1992561,1987515,1987517,1987519,1987909,1987911,1987913,1955715,1955717,1955719,1988411,1980157,1980159,1980161,1980163,2001823,1969773,1969775,1969777,2001857,1966595,1968973,1968975,1968977,1978625,1978627,1978629,1955727,1955729,1955731,2002265,1951617,1951619,1951621,1969553,1969555,1969557,1957623,1993955,1951659,1955871,1990929,1951655,1956005,1956007,1956009,1957633,1977353,1957649,1947941,1977359,1951557,1987677,1987679,1987681,1966053,1966055,1966057,1966059,1955743,1987719,1987723,1957655,1981231,1951643,1956061,1957747,1957749,1957751,1957673,1957675,1957677,1975627,1975629,1975631,1977363,1954411,1967217,1977377,1977379,1977381,1955051,2001759,2001761,2001763,1957601,1967233,1967235,1967237,1967239,1980331,1980333,1980335,1957615,1957617,1957619,1975747,1975749,1975751,1980339,1980341,1980343,1957643,1957645,1957647,1967279,1981311,1951947,1967183,1967185,1967187,1967189,1967365,1967367,1967369,2000465,2000467,2000469,1990999,1991001,1991003,1991005,1977225,1977227,1977229,1992075,1957661,1957663,1957665,1975449,1975451,1980345,1980347,1980349,1969755,1969757,1969759,1969761,1999061,1976161,1976163,1976165,1951657,1979177,1979179,1979181,1951351,1951353,1951355,1951579,1951581,1951583,1980351,1993375,1952095,1952097,1952099,1978885,1978887,1978889,1957705,1975489,1975491,1975493,2000441,2000443,2000445,1964327,1964329,1964331,1967243,1967245,1967247,1968287,2001299,1970025,1970027,1970029,1980815,1990353,1996021,1969263,1970035,1970037,1970039,1964333,1951765,1982045,1958187,1970041,1970043,1970045,1964373,1964375,1964377,1956609,1976531,1964425,1964427,1964429,1956013,1956015,1956017,2002375,2002377,2002379,1957053,1957055,1957057,1989183,1989185,1989187,1981123,1970367,1972203,1957063,1957065,1957067,1977097,1957247,1957249,1957251,1977125,1964597,1964599,1964601,1975775,1975777,1978799,1978801,1978803,1989077,1989079,1989081,2000449,2000451,2000453,1977343,1977345,1977347,1983617,1983619,1983621,1952191,1952193,1952195,1952205,1952207,1952209,1952213,1952215,1952217,1968763,1968765,1968767,1951819,1951821,1951823,2001799,1951835,1951837,1951839,1951875,1951877,1951879,1954651,1980061,1980063,1980065,2002891,1975895,1980165,1980167,1980169,1980363,1980365,1980367,1980377,1980379,1980381,1980397,1980399,1980401,2002509,1980413,1980415,1980417,1980179,1980181,1980183,1980387,1980389,1980391,1981383,1980431,1967141,1967143,1967145,2000721,2000723,1953517,1953519,1953521,1986361,1991641,1991643,1991647,1950905,1975059,1963783,1991775,1991777,1991779,1992829,1992831,1992833,1957681,1957683,1957685,1992883,1992885,1992887,1957725,1957727,1957729,1991785,1991787,1991789,1991805,1991807,1991809}
Elapsed time: 0.0847 seconds.


More information about the Xapian-discuss mailing list