[Snowball-discuss] More patches
Richard Boulton
richard at lemurconsulting.com
Fri Feb 16 11:35:09 GMT 2007
Olly Betts wrote:
> I've just discovered that this patch incorrectly converted romanian1
> files to utf-8, but they were already in utf-8 (the "make check" rule
> didn't catch this because romanian1 isn't built into libstemmer by
> default). Sorry about that.
>
> This patch reverts those files to their original state:
>
> http://oligarchy.co.uk/xapian/patches/snowball-fix-overencoding-of-romanian1.patch
I've reverted the files.
> A related issue - there are a small number of examples in the hungarian
> vocabulary which contain upper case ASCII letters. Would it make sense
> to just change these to lower case for consistency with the other test
> vocabularies?
I think it would make sense to change these to lower case, so I've done
so. It doesn't change the output.txt file at all (as expected).
--
Richard
More information about the Snowball-discuss
mailing list