[Xapian-discuss] using Xapian as backend for google
Felix Antonius Wilhelm Ostmann
ostmann at websuche.de
Thu Dec 7 09:02:03 GMT 2006
We want to build the next google ... ok, not so big ;) currently we
are testing a liddle bit with xapian and it is amazing! respect!
know i must figure out how we can use xapian in the best way. generating
many flint-indexes so we can generate it fast on many machines and merge
it. the frontend will be a webserver with apache and mod_perl ... is it
the best way to run xapian-tcpsrv on other maschines as backend? i think
so ... or is another webserver with mod_perl and perl-bindings the ideal
solution? My question: can someone tell me something about building the
backend for the next google? :) what is important? Raid0 VS Raid1, SCSI
VS SATA, many smaller backends VS some big backends? What would be the
bottleneck (i think DISC I/O)? Is the xapian-tcpsrv the best way? Can
anyone tell me something about such an project?
One other questions: "similar results from one domain".
How can we arrive that goal? The MatchDecider watch over the values with
the domainname and accept only two documents from one domain? Is that
the way?
Thanks for your time :)
And sorry for my poor englisch :(
MfG
Felix Antonius Wilhelm Ostmann
More information about the Xapian-discuss
mailing list