[Snowball-discuss] Error in the vocabulary for Italian stemmer?

Peter Stahl pemistahl at googlemail.com
Fri Jun 18 15:26:37 BST 2010


Hi Martin,

thanks, my port of the Romanian stemmer is now working properly.  Do you know the Natural Language Toolkit (http://www.nltk.org)? It's a collection of Python modules which can be used for natural language processing. Everyone is allowed to contribute something. As NLTK has been helping me a lot during my studies, I wanted to give something back and port your stemmers to Python. I started a discussion about it here: http://groups.google.com/group/nltk-dev/browse_thread/thread/6098341c1ac4b7a3

If NLTK's people don't mind, then I don't mind to make the stemmers available to you, too. I'm going to ask them about it.


Have a nice weekend,
Peter



Am 17.06.2010 um 09:43 schrieb Martin Porter:

> 
> Peter,
> 
> This time you have found something ...
> 
> R1 in Step 4 of the Romanian algorithm definition should be RV. (RV is used
> in the corresponding routine vowel_suffix of the snowball script.) I will
> correct the page on the website soon.
> 
> Sorry about that. I guess it proves that the Romanian algorithm has not
> previously been recoded following the algorithm definition.
> 
> I'm interested that you seem to be recoding all the algorithms in python. If
> they are BSD licensed, might you make them available to us?
> 
> Martin
> 
> At 05:02 PM 6/16/2010 +0200, Peter Stahl wrote:
>> 
>> Hi again,
>> 
>> now I have a different problem with the Romanian stemmer ...
> 
> 
> 




More information about the Snowball-discuss mailing list