[Snowball-discuss] Portuguese stemmer: error in output.txt file?
Martin Porter
martin.porter at grapeshot.co.uk
Fri Sep 17 11:14:14 BST 2004
Portuguese stemmer error.
The problem was in the Snowball script, which was failing to restore the
cursor to the beginning of the word at one significant point. It just needed
an 'and' inserting into the script. So, we have a slightly altered Snowball
script in place, a new file output.txt (and diffs.txt), with the words
insuficiência, deficiência, deficiências, eficiência, impaciência etc
now correctly stemmed, and new C and java versions.
Thanks to Frederick Brault for finding this,
Martin
More information about the Snowball-discuss
mailing list