An open API service indexing awesome lists of open source software.

https://github.com/fracpete/snowball-stemmers-weka-package

Weka package for the snowball stemmers (http://snowball.tartarus.org/).
https://github.com/fracpete/snowball-stemmers-weka-package

java machine-learning plugin preprocessing stemmer stemmers weka

Last synced: 2 months ago
JSON representation

Weka package for the snowball stemmers (http://snowball.tartarus.org/).

Awesome Lists containing this project

README

        

snowball-stemmers-weka-package
==============================

Weka package for the snowball stemmers (http://snowball.tartarus.org/).

**OBSOLETE**

This package is now maintained by the Weka maintainer. The code is available
directly from the Weka subversion repository. Here is a direct link to the
package's location in that repo:

https://svn.cms.waikato.ac.nz/svn/weka/trunk/packages/external/snowball-stemmers/

Releases
--------

* [1.0.1](https://github.com/fracpete/snowball-stemmers-weka-package/releases/download/v1.0.1/snowball-stemmers-1.0.1.zip)
* [1.0.0](https://github.com/fracpete/snowball-stemmers-weka-package/releases/download/v1.0.0/snowball-stemmers-1.0.0.zip)

How to use packages
-------------------

For more information on how to install the package, see:

https://waikato.github.io/weka-wiki/packages/manager/

Maven
-----

Add the following dependency in your `pom.xml` to include the package.

The following dependency automatically pulls in Weka:

```xml

com.github.fracpete
snowball-stemmers-weka-package
1.0.1

```

Use the following dependency to exclude the Weka dependencies:

```xml

com.github.fracpete
snowball-stemmers-weka-package
1.0.1


nz.ac.waikato.cms.weka
weka-dev


org.pentaho.pentaho-commons
pentaho-package-manager



```

Usage
-----

* When using the stemmers independent of Weka:

```java
import org.tartarus.snowball.SnowballStemmer;
import org.tartarus.snowball.ext.porterStemmer;

SnowballStemmer stemmer = new porterStemmer();
stemmer.setCurrent("referred");
stemmer.stem();
System.out.println(stemmer.getCurrent());
```

* Using the stemmers as a Weka stemmer:

```java
import weka.core.stemmers.SnowballStemmer;

SnowballStemmer stemmer = new SnowballStemmer();
stemmer.setStemmer("porter");
System.out.println(stemmer.stem("referred"));
```

* Using the stemmers as part of the `StringToWordVector` filter:

```java
import weka.filters.unsupervised.attribute.StringToWordVector;
import weka.core.stemmers.SnowballStemmer;

SnowballStemmer stemmer = new SnowballStemmer();
stemmer.setStemmer("porter");
StringToWordVector filter = new StringToWordVector();
filter.setStemmer(stemmer);
```