Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/arcusfelis/estem
Stempel (polish stemmer) for Erlang
https://github.com/arcusfelis/estem
Last synced: 3 months ago
JSON representation
Stempel (polish stemmer) for Erlang
- Host: GitHub
- URL: https://github.com/arcusfelis/estem
- Owner: arcusfelis
- License: apache-2.0
- Created: 2015-08-09T15:17:00.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2015-08-09T15:21:30.000Z (almost 9 years ago)
- Last Synced: 2024-01-22T03:47:53.837Z (6 months ago)
- Language: Java
- Size: 594 KB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Lists
- awesome-polish-nlp - Estem - Erlang wrapper (not port) for Stempel stemmer. (Tools / Stemming)
README
Stempel (polish stemmer) for Erlang
=================License: Apache 2 for the bindings (Stempel has its own license).
Also, these two files are blobs:
- priv/lucene-1.4-final.jar
- priv/stempel-1.0.jarYou can compile them from here http://www.getopt.org/stempel/#distrib
Start node in dev mode
```erlang
erl -pa $(pwd)/ebin deps/*/ebin -sname alice
```Start application
```erlang
application:ensure_all_started(estem).
```Test that java works
```erlang
estem_java:pid().
```Run stemmer
```erlang
estem_java:stem([<<"do">>, <<"wynajęcia">>, <<"przytulne">>, <<"bezczynszowe">>, <<"mieszkaniefgxxdd">>, <<"444">>]).
```Whole example
```erlang
~/erlang/estem|master $ erl -pa $(pwd)/ebin deps/*/ebin -sname alice
Erlang/OTP 17 [erts-6.4] [source] [64-bit] [smp:4:4] [async-threads:10] [hipe] [kernel-poll:false]Eshell V6.4 (abort with ^G)
(alice@theta)1> application:ensure_all_started(estem).
16:58:07.890 [info] Application lager started on node alice@theta
16:58:08.040 [info] Java Info: Estem Node Started at: xxxd_java@theta
16:58:08.040 [notice] Java node started
16:58:08.045 [info] Application estem started on node alice@theta
{ok,[syntax_tools,compiler,goldrush,lager,estem]}
(alice@theta)2>
(alice@theta)2> estem_java:pid().
<10787.1.0>
(alice@theta)3>
(alice@theta)3> estem_java:stem([<<"do">>, <<"wynajęcia">>, <<"przytulne">>, <<"bezczynszowe">>, <<"mieszkaniefgxxdd">>, <<"444">>]).
[<<"do">>,
<<119,121,110,97,106,25,99,105,97>>,
<<"przytulny">>,<<"bezczynszowy">>,<<"mieszkaniefgxxdda">>,
<<"444">>]
```Credits
=======Credits to Fernando Benavides from Inaka and his sample project for jinterface
https://github.com/elbrujohalcon/jinterface-sample/