[Snowball-discuss] Porter2 problem

Olly Betts olly at survex.com
Thu May 25 19:36:24 BST 2006


On Thu, May 25, 2006 at 12:14:54PM -0600, Neil wrote:
> Just have a question about the porter 2 algorithm, maybe someone has run
> into this before (I'd be surprised if they hadn't).
> 
> When I stem the word sun I get sun (no surprise there)
> But sunny becomes sunni

This isn't a bug - it's how the stemming algorithms work.  The stem
isn't necessarily a word itself (though it generally looks mostly like
one.)  What really matters is that words the stemmer should conflate
produce the same stem, and words it shouldn't conflate produce different
stems.

Cheers,
    Olly



More information about the Snowball-discuss mailing list