[Snowball-discuss] Porter stemming not dealing well with -gist	-gists
    Martin Porter 
    martin.f.porter at gmail.com
       
    Wed Apr 17 06:47:21 BST 2013
    
    
  
Hi Marc,
The "Porter stemmer" is now frozen in time, but this could be a
something to add to the snowball "English stemmer". It's connected to
the fact that -ist is not, in general, removed. -ologist -> olog
could, however, be added.
I've put your suggestion on the snowball-discuss list,
Thanks, Martin
On Tue, Apr 16, 2013 at 6:29 PM, Marc Schipperheijn
<m.schipperheyn at gmail.com> wrote:
> Hi,
>
> The Oncologist reviewed the results and decided oncology was not for him.
>
> In this example, the stemming filter does't correctly identify the stem of Oncologists as oncol. Since there is a whole class of -gists in the world, particularly in the medical world, this seems an omission.
>
> Perhaps a next version of the algorithm can deal with this.
>
> Kind regards,
> Marc
    
    
More information about the Snowball-discuss
mailing list