Bengt, The vocabulary does not help define the algorithm but illustrates its use, see the last paragraph in section 1 of http://snowball.tartarus.org/texts/introduction.html Nonsense words and mis-spellings are helpful in this, and should not be removed: they reflect a common enough feature of real text. Martin