[Snowball-discuss] Java implementation of the Portuguese stemming algorithm
Pedro Oliveira
pcoliv at student.dei.uc.pt
Wed Nov 5 12:04:32 GMT 2008
I've just downloaded the stemmer from the webpage and it's working. The
<br> tags are used to create a formatted JavaDoc description of the
Class (it is very useful when using an IDE).
Since the filename was edited to portuguese_java.txt, it is better to
delete the "public" statement in the class declaration, i.e.,
public class PortuguesePorterStemmer{
goes to
class PortuguesePorterStemmer{
This way users only have to rename the portuguese_java.txt to
portuguese_java.java and use it. Otherwise, users have to rename the
file to PortuguesePorterStemmer.java, since a public class must have a
file with the same name.
Regards,
Pedro Oliveira
Martin Porter wrote:
> Pedro,
>
> I've put your stemmer in place at
>
> http://snowball.tartarus.org/otherlangs/index.html
>
> The stemmer as I received it was included in the body of the email, and I
> have carefully edited it out of the email. Please check that I have done
> this correctly. The only change I knowingly made was to remove tab character
> at end-of-line. I have left in some "<br>" strings in the first comment. Can
> they be removed?
>
> Since you are planning a revised version soon, cosmetic changes can be
> ignored, I think.
>
> Many thanks,
>
> Martin
>
>
> At 19:00 30/10/2008 +0000, Pedro Oliveira wrote:
>
>> Ok :)
>> In the attachment of this email is my Java implementation. In the future
>> I plan to improve its performance. When I do that, I'll send you another
>> version.
>> There is a link to my toolkit in the file, but, if there is no
>> inconvenient, a link can also be provided in the "affiliation" or
>> "notes" section.
>> . . . . . . .
>>
>
>
More information about the Snowball-discuss
mailing list