[Snowball-discuss] Portuguese stemmer: error in output.txt file?

Martin Porter martin.porter at grapeshot.co.uk
Fri Sep 17 11:14:14 BST 2004


Portuguese stemmer error.

The problem was in the Snowball script, which was failing to restore the
cursor to the beginning of the word at one significant point. It just needed
an 'and' inserting into the script. So, we have a slightly altered Snowball
script in place, a new file output.txt (and diffs.txt), with the words 

    insuficiência, deficiência, deficiências, eficiência, impaciência etc

now correctly stemmed, and new C and java versions.

Thanks to Frederick Brault for finding this,

Martin






More information about the Snowball-discuss mailing list