[Snowball-discuss] probable bug in English stemmer

Andrew Aksyonoff shodan at shodan.ru
Sun Feb 6 23:09:30 GMT 2011


Hello Martin,

Sunday, February 6, 2011, 6:19:44 PM, you wrote:
MP> It is very nice to hear from you again after so many years!
MP> Yes, the snowball-discuss group goes on as ever, though now
MP> I am 66 I'm slowing down somewhat. Richard Boulton of course is
MP> still very young.

Oh wow... time flies like an arrow.  For some reason I thought
you were younger, maybe in your fifties.

MP> In detail, Porter2 stems
MP> exceptionalism to exceptional (step 2)
MP> exceptional to exception (step 3)
MP> exception to except (step 4)
MP> see http://snowball.tartarus.org/algorithms/english/stemmer.html

Many thanks for the clarification.  It's all clear now.  I indeed
did not handle -tional and -ational on Step 3 in my implementation,
and that made me confuse (valid) Step 3 reduction of -tional for
a (buggy) double reduction on Step 4.

Just out of curiosity, I wonder if that change was made after 2001.
That is, whether I made a bug back then, or just now when doing the
update.

MP> Similarly the other words you mention. I hope I've got this right:
MP> like you, I am now a bit rusty on the English stemmer.

I'm getting up to speed again lately. :)


-- 
Best regards,
 Andrew                            mailto:shodan at shodan.ru




More information about the Snowball-discuss mailing list