[Snowball-discuss] Fwd: How to add Tamil Support to stemmer?

Shrinivasan T tshrinivasan at gmail.com
Wed Mar 27 13:07:26 GMT 2013


On Wed, Mar 27, 2013 at 6:31 PM, Richard Boulton <richard at tartarus.org> wrote:
> On 27 March 2013 12:03, Shrinivasan T <tshrinivasan at gmail.com> wrote:
>> The patch for stemmer for tamil language is here.
>> https://github.com/rdamodharan/tamil-stemmer/blob/master/snowball-tamil.patch
>>
>> We apply the patch and compile stemmer to make it work with tamil language.
>>
>> How to add the patch to the upstream stemmer?
>
> "rdamodharan" has actually done exactly what's needed for this, by
> submitting a pull request on github to our repository;
> https://github.com/snowballstem/snowball/pull/2  Unfortunately, I
> haven't had a chance to look at this so far; I will make sure to make
> time to do so over the next few days.
>
> I have no way of evaluating the results of this stemmer, but am
> willing to take the word of Tamil speakers as to whether the algorithm
> is of use.  There may be some changes to the code that should be made
> to improve performance, as Martin mentioned.  One thing that would be
> of great use is a sample dataset, similar to that in
> https://github.com/snowballstem/snowball-data/blob/master/english/voc.txt,
> together with a sample file containing the corresponding expected
> output.

Thanks for the update.

I sent this to damo.

Hope me may work on this further.







-- 
Regards,
T.Shrinivasan


My Life with GNU/Linux : http://goinggnu.wordpress.com
Free/Open Source Jobs : http://fossjobs.in

Get CollabNet Subversion Edge :     http://www.collab.net/svnedge



More information about the Snowball-discuss mailing list