<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Thank Martin for your suggestion. I will ask the question in Annif forum too.</div>
<div id="appendonsend"></div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> Martin Porter <martin.f.porter@gmail.com><br>
<b>Sent:</b> Monday, August 9, 2021 2:54 PM<br>
<b>To:</b> Jeet Biswas <j.biswas@wti-frankfurt.com><br>
<b>Cc:</b> snowball-discuss@lists.tartarus.org <snowball-discuss@lists.tartarus.org><br>
<b>Subject:</b> Re: [Snowball-discuss] Stemming different spellings</font>
<div> </div>
</div>
<div class="BodyFragment"><font size="2"><span style="font-size:11pt">
<div class="PlainText">Jeet,<br>
<br>
I think your question should really be directed to the annif.org<br>
people, but I can give quick answers to 1 and 2,<br>
<br>
For 1, see <a href="https://lists.tartarus.org/pipermail/snowball-discuss/2013-July/001478.html">
https://lists.tartarus.org/pipermail/snowball-discuss/2013-July/001478.html</a><br>
This question has often come up, and 2013 was not the first time, but<br>
I think the note summarises (or summarizes) the issue fairly well.<br>
<br>
For 2, snowball is for word stemming, not word respelling.<br>
color/colour is like gaol/jail, a word that can be spelled in two<br>
ways. Such normalisation would have to be done outside snowball<br>
therefore. Perhaps annif does this.<br>
<br>
Martin<br>
</div>
</span></font></div>
</body>
</html>