[Snowball-discuss] Fwd: Modified Stemming To Generate Valid Words

Yash Mittal yashmittal2009 at gmail.com
Sun May 14 18:14:45 BST 2017


Greetings,

I hope you are doing well. Over the past year or so, I have been working on
modifying Dr. Porter's algorithm to stem words in a way that valid English
words are generated. You can check the Github repository here
<https://github.com/ymittal/text-analyzer/tree/master/accuracy>. The readme
describes my motivation and the problem. As of now, my modified stemmer
outputs around 89% valid words for the same list of words Dr. Porter used
for his algorithm.

I was wondering if you have any suggestions on how I can improve this
project. Additionally, would you be interested in adding this project to
the list of projects <http://snowballstem.org/projects.html> on your
website? Please let me know what you think.

Thank you for your time and consideration,

*​Yash Mittal*
Bucknell University Class of 2019
Presidential Fellow
Computer Science & Engineering Major
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.tartarus.org/mailman/private/snowball-discuss/attachments/20170514/971469c8/attachment.html>


More information about the Snowball-discuss mailing list