[Xapian-discuss] UTF8 support plans (without stemming)
James Aylett
james-xapian at tartarus.org
Wed Jun 29 09:21:47 BST 2005
On Wed, Jun 29, 2005 at 04:19:57AM +0100, Olly Betts wrote:
> This is using a patched version of the QueryParser. Currently I'm using
> glib's unicode routines, but I wonder if we really want to add a
> dependency on glib when we only use a very tiny part of it.
>
> I already have C code for handling utf-8. I'm going to see what else is
> around for unicode versions of "isalpha" etc.
IBM ICU is probably a better choice. It also supplies a whole load of
other useful features for Unicode handling, so it's not a ridiculous
thing for people to be using anyway if they're doing Xapian + Unicode
work.
<http://icu.sourceforge.net/>
J
--
/--------------------------------------------------------------------------\
James Aylett xapian.org
james at tartarus.org uncertaintydivision.org
More information about the Xapian-discuss
mailing list