[Xapian-discuss] Xapian::Queryparser / Encoding Problem (Utf8)

Olly Betts olly at survex.com
Tue Aug 16 18:49:54 BST 2005


On Wed, Aug 10, 2005 at 04:41:41PM +0200, R. Mattes wrote:
> On Wed, 2005-08-10 at 15:29 +0100, Richard Boulton wrote:
> > The query parser itself shouldn't need too much work - you'll probably
> > need to look at the accent normalising code (see accentnormalisingitor.h
> > and symboltab.h).
> 
> Well, looks like this will be my next task on the stack ...

I've already done this - Gmane is using a patched version of the
QueryParser on utf-8 data (without any stemming).

As I've said before, anyone who wants the patch is welcome to it.  I
can't just apply it to SVN as is though as it'll break anyone using
iso-8859-1 queries or stemming.  It also currently adds a dependency on
glib which is probably something we don't want to do.

Cheers,
    Olly



More information about the Xapian-discuss mailing list