[Snowball-discuss] Changes to Porter2

Martin Porter martin_porter@softhome.net
Thu, 22 Nov 2001 03:06:26 -0700


I have made some changes to the porter2 algorithm.

The documentation errors noticed by Andrew Aksyonoff have been corrected.

-s removal has been changed. You now need a vowel somewhere before the
letter before the s. So 'gas', 'this', 'has', 'was' keep the s, 'dogs',
'cats', 'woos', 'kiwis' lose the s. Usefully, the s is not removed from
non-words like 'cvs', 'spss', 'lms' etc.

In general there is a problem identifying plurals of words ending Xs, where
X is vowel other than e. As you know, porter2 leaves -us alone but removes s
after a,i,o. This works fairly well. 

I have added a few more exceptions in following suggestions from Steve Tolkin.


_______________________________________________
Snowball-discuss mailing list
Snowball-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/snowball-discuss

_____________________________________________________________________
VirusChecked by the Incepta Group plc
_____________________________________________________________________