[Snowball-discuss] Snowball: ISO-8859 to UTF-8

Martin Porter martin at porterloo.wanadoo.co.uk
Thu May 19 10:12:22 BST 2011


Tiago,

Richard Boulton set up the libstemmer approach, so he can best advise, but
the ISO-8859 source for Portuguese (and several of the other languages) is
used both for ISO-8859 and utf-8 Unicode compilations, since the codes --
a-acute etc -- are the same in both. You just alter the compilation options
in the snowball compiler.

Martin


>I downloaded snowball and was checking the sources, there was only a
>ISO-8859 and MS-DOS source there.






More information about the Snowball-discuss mailing list