[Xapian-discuss] queryparser thinks ø is o

James Aylett james-xapian at tartarus.org
Tue Sep 13 08:22:55 BST 2005


On Tue, Sep 13, 2005 at 05:08:08AM +0100, Olly Betts wrote:

> The transliteration should also really be language dependent - in German
> ä -> ae, but that's not appropriate in Swedish I believe.  But
> language dependent normalisation is what the stemming algorithms do!  So
> I think this really should get folded into the stemming algorithms in
> languages where it makes sense (and languages where it doesn't wouldn't
> do anything).

Is this something we can fix up when we move to UTF-8 Snowball
stemmers? (At 1.0 or whenever.)

J

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james at tartarus.org                               uncertaintydivision.org



More information about the Xapian-discuss mailing list