[Snowball-discuss] Mismatch between vocab.txt and output.txt

Olly Betts olly@survex.com
Mon Oct 14 01:17:02 2002


I've found mismatches for french and finnish stemmers.

The first disagreement for finnish is that the stemmer produces
"aachenin" but output.txt contains "aachen".

And the first for french is "abaisai" when output.txt contains "abaiss".

I can generate full lists if they're useful, but I assume you have
testing scripts of your own...

Are the stemmers wrong, or being miscompiled, or is output.txt just
out of date for these two?

Cheers,
    Olly