[Snowball-discuss] newbie does snowball remove stop words?

giselle.muller9 giselle.muller9 at gmail.com
Tue Mar 24 21:47:54 GMT 2015


WHO ARE YOU? GET ME OF YOUR MAILING LIST AND DATABASE.  I HAVE NEVER SUBSCRIBE 

I DO NOT WANT TO HEAR FROM ANY OF YOU FROM THIS OR ANY OTHER ORGANISATION 

I WILL TAKE ACTION UNLESS THESE EMAIL CEASE FROM ANYONE ASSOCIATED WITH THIS OR ANY OTHER ORGANISATION WHO BLATANTLY CONTINUE TO SEND EMAILS TO ME


Sent from Samsung tablet


-------- Original message --------
From: Oerd Cukalla <rosaccu at gmail.com> 
Date: 25/03/2015  01:22  (GMT+10:00) 
To: Andrew Davidson <adavidson2 at apple.com>, Olly Betts <olly at survex.com> 
Cc: snowball-discuss at lists.tartarus.org 
Subject: Re: [Snowball-discuss] newbie does snowball remove stop words? 

Hi Andrew,

you may want to implement a lookup table populated with the stopwords and only stem a word in input if it is not in the stopwords table.

It should be quite easy to implement in Java, but let me know if you need assistance.

Have a nice day, 
    Oerd


On Tue, 17 Mar 2015 21:36 Andrew Davidson <adavidson2 at apple.com> wrote:
Hi Olly

I imagine removing stop words is a fairly common requirement. Any idea how people implement stop word removal with  snowball? 

The reason I originally thought snowball provided stop word removal was because of the following links  http://snowball.tartarus.org/algorithms/english/stop.txt (from http://snowball.tartarus.org/algorithms/english/stemmer.html)

It seems to suggest there is some stop word support

Thanks

Andy

On Mar 16, 2015, at 10:20 PM, Olly Betts <olly at survex.com> wrote:

On Mon, Mar 16, 2015 at 06:31:09PM -0700, Andrew Davidson wrote:
today I downloaded the java version of snowball and compiled it. I ran
a couple of little example through it. It does not appear to remove
stop works. Is this a bug? 

That's not a bug - snowball is a stemmer, not a stopword remover.

Cheers,
   Olly

_______________________________________________
Snowball-discuss mailing list
Snowball-discuss at lists.tartarus.org
http://lists.tartarus.org/mailman/listinfo/snowball-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.tartarus.org/mailman/private/snowball-discuss/attachments/20150325/e58f3672/attachment.html>


More information about the Snowball-discuss mailing list