[Snowball-discuss] Fwd: How to add Tamil Support to stemmer?
Shrinivasan T
tshrinivasan at gmail.com
Wed Mar 27 13:07:26 GMT 2013
On Wed, Mar 27, 2013 at 6:31 PM, Richard Boulton <richard at tartarus.org> wrote:
> On 27 March 2013 12:03, Shrinivasan T <tshrinivasan at gmail.com> wrote:
>> The patch for stemmer for tamil language is here.
>> https://github.com/rdamodharan/tamil-stemmer/blob/master/snowball-tamil.patch
>>
>> We apply the patch and compile stemmer to make it work with tamil language.
>>
>> How to add the patch to the upstream stemmer?
>
> "rdamodharan" has actually done exactly what's needed for this, by
> submitting a pull request on github to our repository;
> https://github.com/snowballstem/snowball/pull/2 Unfortunately, I
> haven't had a chance to look at this so far; I will make sure to make
> time to do so over the next few days.
>
> I have no way of evaluating the results of this stemmer, but am
> willing to take the word of Tamil speakers as to whether the algorithm
> is of use. There may be some changes to the code that should be made
> to improve performance, as Martin mentioned. One thing that would be
> of great use is a sample dataset, similar to that in
> https://github.com/snowballstem/snowball-data/blob/master/english/voc.txt,
> together with a sample file containing the corresponding expected
> output.
Thanks for the update.
I sent this to damo.
Hope me may work on this further.
--
Regards,
T.Shrinivasan
My Life with GNU/Linux : http://goinggnu.wordpress.com
Free/Open Source Jobs : http://fossjobs.in
Get CollabNet Subversion Edge : http://www.collab.net/svnedge
More information about the Snowball-discuss
mailing list