[Snowball-discuss] bug in the java version: "cannot be cast to org.tartarus.snowball.SnowballStemmer"

Luangsay Sourygna luangsay at gmail.com
Tue Mar 15 20:14:17 GMT 2016


Hi,

I am quite new to Snowball so I hope that this "bug email" is really
pertinent.

It seems like the java version of Snowball has a small bug.
When I try to execute snowball with a 3rd party stemmer, I get this
exception:
Exception in thread "main" java.lang.ClassCastException:
org.tartarus.snowball.ext.DutchKraaijPohlmannStemmer cannot be cast to
org.tartarus.snowball.SnowballStemmer
        at org.tartarus.snowball.TestApp.main(TestApp.java:30)

As a workaround, I do a quick sed on the generated java class. In my case:
sed -i
"s/org.tartarus.snowball.SnowballProgram/org.tartarus.snowball.SnowballStemmer/"
java/org/tartarus/snowball/ext/DutchKraaijPohlmannStemmer.java

(by default my generated class
extends org.tartarus.snowball.SnowballProgram. And it seems like it should
extend org.tartarus.snowball.SnowballStemmer instead).

As a reference, please find all the steps I use to compile my snowball
stemmer:
cd /opt/
git clone https://github.com/snowballstem/snowball.git
cd snowball/
make
vim dutchKraaij-Pohlmann.sbl    # copy algorithm from
http://snowballstem.org/algorithms/kraaij_pohlmann/stemmer.html
mkdir java/org/tartarus/snowball/ext
./snowball dutchKraaij-Pohlmann.sbl -j -o
java/org/tartarus/snowball/ext/DutchKraaijPohlmannStemmer -n
DutchKraaijPohlmannStemmer
sed -i
"s/org.tartarus.snowball.SnowballProgram/org.tartarus.snowball.SnowballStemmer/"
java/org/tartarus/snowball/ext/DutchKraaijPohlmannStemmer.java
cd java
javac org/tartarus/snowball/*.java org/tartarus/snowball/ext/*.java
# Test:
java org.tartarus.snowball.TestApp DutchKraaijPohlmann <(echo meisje
boerderij rekenning adverteer adverteren gadverteerd artikelen artikeltje
openbaar zaken gelezen gebroken)


Sourygna
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.tartarus.org/mailman/private/snowball-discuss/attachments/20160315/c2a5b506/attachment.html>


More information about the Snowball-discuss mailing list