[Snowball-discuss] Error in the vocabulary for Italian stemmer?
Peter Stahl
pemistahl at googlemail.com
Fri Jun 18 15:26:37 BST 2010
Hi Martin,
thanks, my port of the Romanian stemmer is now working properly. Do you know the Natural Language Toolkit (http://www.nltk.org)? It's a collection of Python modules which can be used for natural language processing. Everyone is allowed to contribute something. As NLTK has been helping me a lot during my studies, I wanted to give something back and port your stemmers to Python. I started a discussion about it here: http://groups.google.com/group/nltk-dev/browse_thread/thread/6098341c1ac4b7a3
If NLTK's people don't mind, then I don't mind to make the stemmers available to you, too. I'm going to ask them about it.
Have a nice weekend,
Peter
Am 17.06.2010 um 09:43 schrieb Martin Porter:
>
> Peter,
>
> This time you have found something ...
>
> R1 in Step 4 of the Romanian algorithm definition should be RV. (RV is used
> in the corresponding routine vowel_suffix of the snowball script.) I will
> correct the page on the website soon.
>
> Sorry about that. I guess it proves that the Romanian algorithm has not
> previously been recoded following the algorithm definition.
>
> I'm interested that you seem to be recoding all the algorithms in python. If
> they are BSD licensed, might you make them available to us?
>
> Martin
>
> At 05:02 PM 6/16/2010 +0200, Peter Stahl wrote:
>>
>> Hi again,
>>
>> now I have a different problem with the Romanian stemmer ...
>
>
>
More information about the Snowball-discuss
mailing list