An open API service indexing awesome lists of open source software.

https://github.com/codelibs/elasticsearch-analysis-kuromoji-unidic

Elasticsearch's Analyzer for Kuromoji with UniDic
https://github.com/codelibs/elasticsearch-analysis-kuromoji-unidic

Last synced: 3 months ago
JSON representation

Elasticsearch's Analyzer for Kuromoji with UniDic

Awesome Lists containing this project

README

        

Elasticsearch Analysis Kuromoji Unidic
=======================

## Overview

Elasticsearch Analysis Kuromoji Plugin provides Tokenizer/CharFilter/TokenFilter for Kuromoji with Unidic.

## Version

[Versions in Maven Repository](http://central.maven.org/maven2/org/codelibs/elasticsearch-analysis-kuromoji-unidic/)

### Issues/Questions

Please file an [issue](https://github.com/codelibs/elasticsearch-analysis-kuromoji-unidic/issues "issue").
(Japanese forum is [here](https://github.com/codelibs/codelibs-ja-forum "here").)

## Installation

$ $ES_HOME/bin/elasticsearch-plugin install org.codelibs:elasticsearch-analysis-kuromoji-unidic:6.4.0

## References

### Analyzer, Tokenizer, TokenFilter, CharFilter

The plugin includes these analyzer and tokenizer, tokenfilter.

| name | type |
|:-----------------------------------------|:-----------:|
| kuromoji\_unidic\_iteration\_mark | charfilter |
| kuromoji\_unidic | analyzer |
| kuromoji\_unidic\_tokenizer | tokenizer |
| kuromoji\_unidic\_baseform | tokenfilter |
| kuromoji\_unidic\_part\_of\_speech | tokenfilter |
| kuromoji\_unidic\_readingform | tokenfilter |
| kuromoji\_unidic\_stemmer | tokenfilter |

### Usage

See [Elasticsearch Kuromoji](https://github.com/elastic/elasticsearch-analysis-kuromoji "elasticsearch-analysis-kuromoji").

### Update Kuromoji Jar File

If you want to replace with the latest Lucene Unidic jar file, download it from https://maven.codelibs.org/org/codelibs/lucene-analyzers-kuromoji-unidic/ and then replace old file in $ES_HOME/plugins/analysis-kuromoji-unidic.

### What is UniDic

See [mecab-unidic](https://ja.osdn.net/projects/unidic/).

## Use Lucene Kuromoji for UniDic

If you want to use Lucene Kuromoji for Unidic in your application other than elasticsearch, you can use lucene-analyzers-kuromoji-unidic jar file, not this plugin.
To use the jar file, put the following settings into your pom.xml.

...


codelibs.org
CodeLibs Repository
https://maven.codelibs.org/


...


org.codelibs
lucene-analyzers-kuromoji-unidic
7.4.0-2_1_2

...

See [CodeLibs Maven Repository](https://maven.codelibs.org/org/codelibs/lucene-analyzers-kuromoji-unidic/).