[Snowball-discuss] Stemming 'communing' and 'communed'

Michael Edwards mbedwards at gmail.com
Thu Mar 29 23:39:27 BST 2007


On 3/29/07, Martin Porter <martin.porter at grapeshot.co.uk> wrote:
> You are right: the spec is not clear on this, and I will have to alter
> it (I'll try and do so in the next few days). One way of looking at it
> is that in commun- gener-, the first vowel is treated as a consonant,
> and then cXmmun gXner become short words, and another way of looking at
> it is to say that a short word is to be defined as a something ending
> with a short syllable entirely outside R1.
>
> So perhaps R1, R2 should be defined first, then shortness in terms of
> R1.

The "if there is a short syllable immediately preceding R1" makes
sense and seems to be easy to implement. Thanks for the clarification
on this.

> (Although the spec makes no use of R1 in defining 'short', the snowball
> script uses R1 to determine whether something is short, so there is a
> connection.)

Yeah, I saw that code, but due to my unfamiliarity with snowball's
syntax I wasn't quite sure what it was doing.

Best regards,
Michael



More information about the Snowball-discuss mailing list