[Snowball-discuss] Irregular Plurals and Ablaut Plurals

Nathan Wilson Nathan.Wilson at PPC.com
Fri Aug 13 20:14:12 BST 2010


I am using CIS Documentum 6.5 SP3 which is using Snowball for its stemming logic.  I have noticed that Snowball does not handle Irregular or Ablaut Plurals.
Can these instances be handled in the exception1 routine for the English stemmer?

'men'    (<-'man')
'women'              (<-'woman')
'children'             (<-'child')
'oxen'   (<-'ox')
'ran'       (<-'run')

There are more and care should be taken to not explode the exception1 routine, but inclusion of the more common occurrences may be useful.
Is this where Irregular and Ablaut Plurals can be handled, if not is there a place to handle such stemming?
If these instances can be handled here is there any idea on when or if this will be included?

================================================================
Nathan S Wilson
Taxonomy Analyst
Project Performance Corporation
Part of the AEA group

1760 Old Meadow Rd., Floor, McLean, Virginia 22102
ph: 703.748.7254  cell: 260.312.2811  fax: 703.748.7001
web: www.ppc.com<http://www.ppc.com>  email:nathan.wilson at ppc.com
================================================================

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.tartarus.org/mailman/private/snowball-discuss/attachments/20100813/ba04282f/attachment.htm>


More information about the Snowball-discuss mailing list