[Xapian-discuss] using Xapian as backend for google

Felix Antonius Wilhelm Ostmann ostmann at websuche.de
Thu Dec 7 09:02:03 GMT 2006


We want to build the next google ... ok, not so big  ;)  currently we 
are testing a liddle bit with xapian and it is amazing! respect!

know i must figure out how we can use xapian in the best way. generating 
many flint-indexes so we can generate it fast on many machines and merge 
it. the frontend will be a webserver with apache and mod_perl ... is it 
the best way to run xapian-tcpsrv on other maschines as backend? i think 
so ... or is another webserver with mod_perl and perl-bindings the ideal 
solution? My question: can someone tell me something about building the 
backend for the next google? :) what is important? Raid0 VS Raid1, SCSI 
VS SATA, many smaller backends VS some big backends? What would be the 
bottleneck (i think DISC I/O)? Is the xapian-tcpsrv the best way? Can 
anyone tell me something about such an project?

One other questions: "similar results from one domain".
How can we arrive that goal? The MatchDecider watch over the values with 
the domainname and accept only two documents from one domain? Is that 
the way?

Thanks for your time :)
And sorry for my poor englisch :(

MfG
Felix Antonius Wilhelm Ostmann




More information about the Xapian-discuss mailing list