[Snowball-discuss] More patches

Richard Boulton richard at lemurconsulting.com
Fri Feb 16 11:35:09 GMT 2007


Olly Betts wrote:
> I've just discovered that this patch incorrectly converted romanian1
> files to utf-8, but they were already in utf-8 (the "make check" rule
> didn't catch this because romanian1 isn't built into libstemmer by
> default).  Sorry about that.
> 
> This patch reverts those files to their original state:
> 
> http://oligarchy.co.uk/xapian/patches/snowball-fix-overencoding-of-romanian1.patch

I've reverted the files.

> A related issue - there are a small number of examples in the hungarian
> vocabulary which contain upper case ASCII letters.  Would it make sense
> to just change these to lower case for consistency with the other test
> vocabularies?

I think it would make sense to change these to lower case, so I've done 
so.  It doesn't change the output.txt file at all (as expected).

-- 
Richard



More information about the Snowball-discuss mailing list