[Snowball-discuss] Stop word lists
Oleg Bartunov
oleg@sai.msu.su
Tue Oct 8 21:10:02 2002
On Tue, 8 Oct 2002, Martin Porter wrote:
>
> Oleg,
>
> Thanks for the Russian list, although I'm not sure whether I know enoug=
h
> about Russian to make a sensible comparison with my list and Alex's lis=
t. We
> will see ...
>
> You might be able to give me some feedback on the list I put up in
> ...russian/stop.txt
Martin, how did you get a ranked list ?
also, I notice several misprints in your list of russian stop words
(corrected line follows original):
=CF=D0=D1=D4 | again
=CF=D0=D1=D4=D8 | again
=D3=C5=C2=D8=D1 | oneself
=D3=C5=C2=D1 | oneself
=CD=CF=D6=C5=D4 | usually with '=C2=D9=D4` as `maybe`
=CD=CF=D6=C5=D4 | usually with `=C2=D9=D4=D8' as `aybe`
=C5=D4=CF=C7=CF | genitive form of `this'
=DC=D4=CF=C7=CF | genitive form of `this'
=CB=C1=CB=CF=C9 | which
=CB=C1=CB=CF=CA | which
=C4=D2=D5=C7=CF=C9 | another
=C4=D2=D5=C7=CF=CA | another
=CD=CE=CF=CF=C7=CF | lots
=CD=CE=CF=C7=CF | lots
=DC=D4=CF=C9 | oblique form of `=C5=D4=C1', fem. `this'
=DC=D4=CF=CA | oblique form of `=DC=D4=C1`, fem. `this'
=D4=C1=CB=CF=C9 | such a one
=D4=C1=CB=CF=CA | such a one
Also, it's worth to mention that russian character '=A3' has translated t=
o '=C5'
in this list.
>
> Martin
>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83