[Xapian-discuss] UTF8 support plans (without stemming)

Olly Betts olly at survex.com
Thu Apr 28 13:40:44 BST 2005


On Thu, Apr 28, 2005 at 01:09:42PM +0100, James Aylett wrote:
> There may (I can't remember) be some practical issues about
> putting NUL bytes in there

Not really.  In a term name, quartz internally encodes each zero byte
using 2 bytes, so the maximum term length is reduced, but that's the
only issue.  The "new quartz" won't have even that restriction.

Cheers,
    Olly



More information about the Xapian-discuss mailing list