[Snowball-discuss] Re: Error in Spanish stemming algorithm

Martin Porter martin_porter@softhome.net
Sat, 17 Aug 2002 05:36:50 -0600


Mabel,

Could you send or copy any future Snowball related emails to

    snowball-discuss@lists.sourceforge.net

please?=20

To answer your specific question, the definitions of R1 and RV are
independent of each other, and start with the whole word. Another way of
looking at RV is to see four cases:

    VC...V|...
    CC...V|...
    VV...C|...
    CV.|...  =20

If the word begins VC (vowel consonant) RV ends after the next V, if it
begins CC RV ends after the next V, if it begins VV RV ends after the next
C, and if it begins CV RV ends after the third letter. Or RV is the whole
word if none of these patterns apply.

Apart from that I can't really help with your Visual Basic implementation
until you give me an example word which it stems differently from the
Snowball algorithm. If you will do that, I can send you back a picture of
how the regions are defined and how the word is treated by the different st=
eps.

It is possible that the given Spanish algorithm is incorrect: we found
errors in the Russian algorithm definition when it came to be recoded in
another language. But I can't see anything immediately wrong after a furthe=
r
cursory inspection.

Martin

>On Fri, 2002-08-16 at 00:35, MABEL PENIA GARCIA wrote:
>Hola, programe el algoritmo en visual basic para espa=F1ol y no reduce
>igual como los ejemplos de palabras que dan, especialmente en Step 1:
>Standard suffix removal hay un lio con R2 o R1, estaba pensando tambien
>si estoy fallando en la definicion de R1.
>Una pregunta: para calcular R1 se parte de Rv o de la palabra en si.
>Pero igual no funciona para todos los casos.
>Por favor me dicen si existen modificaciones o donde puedo buscar.
>Gracias por todo. Cualquier cosita sera bien recibido.
>Chau.



-------------------------------------------------------
This sf.net email is sponsored by: OSDN - Tired of that same old
cell phone?  Get a new here for FREE!
https://www.inphonic.com/r.asp?r=3Dsourceforge1&refcode1=3Dvs3390
_______________________________________________
Snowball-discuss mailing list
Snowball-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/snowball-discuss