[Snowball-discuss] Croatian stemmer help

Martin Porter martin at porterloo.wanadoo.co.uk
Fri Aug 1 15:40:30 BST 2008





>For example: magareta -> magare
>
>So if I get it good this is something like:
>non-vowel ['eta'] <- 'e'
>or
>['eta'] non-vowel <- 'e'

Yes that right. In fact it's the second of the two, since you would scan the
string backwards, and the test for non vowel comes after the test for 'eta'
(but although scanning backwards, literal strings are not written out
backwards.)

An advantage of Snowball is that the code it generates is really pretty fast.

I might be able to help you a bit. It will depend on how busy I am with
other things.

M.





More information about the Snowball-discuss mailing list