[Snowball-discuss] no words ending on alism in Porter diffs txt file

Ward Bekker (TTY) ward at tty.nl
Fri Oct 21 19:58:11 BST 2011


Hi,

Two questions:

1) While coverage testing the Erlang implementation of the Porter algorithm using the "Vocabulary + stemmed equivalent" file, I noticed that there a no words included that end on "alism". Is this on purpose?

See http://snowball.tartarus.org/algorithms/porter/diffs.txt

2) In the Vocabulary + stemmed equivalent" file I noticed that eg. "terribly" is stemmed to "terribli". In the Erlang version this is stemmed to "terribl", which maches the way "terrible" is stemmed. That looks useful to the untrained eye. Is this a side effect of the abli  →  able replaced by bli  →  ble change?

Regards,

Ward Bekker

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.tartarus.org/mailman/private/snowball-discuss/attachments/20111021/9bb48ace/attachment.htm>


More information about the Snowball-discuss mailing list