https://github.com/codelibs/elasticsearch-analysis-kuromoji-unidic
Elasticsearch's Analyzer for Kuromoji with UniDic
https://github.com/codelibs/elasticsearch-analysis-kuromoji-unidic
Last synced: 3 months ago
JSON representation
Elasticsearch's Analyzer for Kuromoji with UniDic
- Host: GitHub
- URL: https://github.com/codelibs/elasticsearch-analysis-kuromoji-unidic
- Owner: codelibs
- License: apache-2.0
- Created: 2018-09-27T21:37:58.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2019-02-21T07:03:02.000Z (over 6 years ago)
- Last Synced: 2025-01-28T10:36:54.903Z (5 months ago)
- Language: Java
- Homepage:
- Size: 27.3 KB
- Stars: 3
- Watchers: 9
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Elasticsearch Analysis Kuromoji Unidic
=======================## Overview
Elasticsearch Analysis Kuromoji Plugin provides Tokenizer/CharFilter/TokenFilter for Kuromoji with Unidic.
## Version
[Versions in Maven Repository](http://central.maven.org/maven2/org/codelibs/elasticsearch-analysis-kuromoji-unidic/)
### Issues/Questions
Please file an [issue](https://github.com/codelibs/elasticsearch-analysis-kuromoji-unidic/issues "issue").
(Japanese forum is [here](https://github.com/codelibs/codelibs-ja-forum "here").)## Installation
$ $ES_HOME/bin/elasticsearch-plugin install org.codelibs:elasticsearch-analysis-kuromoji-unidic:6.4.0
## References
### Analyzer, Tokenizer, TokenFilter, CharFilter
The plugin includes these analyzer and tokenizer, tokenfilter.
| name | type |
|:-----------------------------------------|:-----------:|
| kuromoji\_unidic\_iteration\_mark | charfilter |
| kuromoji\_unidic | analyzer |
| kuromoji\_unidic\_tokenizer | tokenizer |
| kuromoji\_unidic\_baseform | tokenfilter |
| kuromoji\_unidic\_part\_of\_speech | tokenfilter |
| kuromoji\_unidic\_readingform | tokenfilter |
| kuromoji\_unidic\_stemmer | tokenfilter |### Usage
See [Elasticsearch Kuromoji](https://github.com/elastic/elasticsearch-analysis-kuromoji "elasticsearch-analysis-kuromoji").
### Update Kuromoji Jar File
If you want to replace with the latest Lucene Unidic jar file, download it from https://maven.codelibs.org/org/codelibs/lucene-analyzers-kuromoji-unidic/ and then replace old file in $ES_HOME/plugins/analysis-kuromoji-unidic.
### What is UniDic
See [mecab-unidic](https://ja.osdn.net/projects/unidic/).
## Use Lucene Kuromoji for UniDic
If you want to use Lucene Kuromoji for Unidic in your application other than elasticsearch, you can use lucene-analyzers-kuromoji-unidic jar file, not this plugin.
To use the jar file, put the following settings into your pom.xml....
codelibs.org
CodeLibs Repository
https://maven.codelibs.org/
...
org.codelibs
lucene-analyzers-kuromoji-unidic
7.4.0-2_1_2
...See [CodeLibs Maven Repository](https://maven.codelibs.org/org/codelibs/lucene-analyzers-kuromoji-unidic/).