[Snowball-discuss] The Danish stemmer
Martin Porter
martin at porterloo.wanadoo.co.uk
Wed Jul 15 09:18:38 BST 2009
>
>I'm trying to implement the Danish stemmer . . . . My question
>refers to the last 1% since I can't seem to find any bugs in my
>implementation.
>
>Of the first 1000 words I get these errors
>Word/My stemmer/diffs.txt
>adelig/ad/ade
> . . . .
Michael, hi.
The reason "elig" is not removed in step 3 is that R1 is defined so that it
is preceded by at least 3 letters, and so "elig" does not lie in R1,
a d e l i g
| |R1--->
I'm sorry this is not more clear: for the definition of R1, there is a
cross-reference to the German stemmer, where it is defined.
The other cases you note have a similar explanation (I guess).
I'll try and clarify the definition.
Martin
More information about the Snowball-discuss
mailing list