https://github.com/buganini/elasticsearch-token-flinger-plugin
A smart elasticsearch token filter plugin
https://github.com/buganini/elasticsearch-token-flinger-plugin
Last synced: 10 months ago
JSON representation
A smart elasticsearch token filter plugin
- Host: GitHub
- URL: https://github.com/buganini/elasticsearch-token-flinger-plugin
- Owner: buganini
- Created: 2017-10-12T19:45:57.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2017-10-12T19:46:31.000Z (over 8 years ago)
- Last Synced: 2025-05-15T23:12:32.269Z (about 1 year ago)
- Language: Java
- Size: 64.5 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Token-Flinger try to identify the type of token then apply proper filter on it.
## Current implementation
Type | Filter
-----|-------
Latin | Unchanged
CJK | NGram
Other | NGram
## Example (test.py)
Input | Output
------|-------
test abc5566 | test abc abc5 abc55 bc5 bc55 bc556 c55 c556 c5566 556 5566 566
abc123 Citroën (NFC) | abc abc1 abc12 bc1 bc12 bc123 c12 c123 123 Citroën NFC
abc123 Citroën (NFD) | abc abc1 abc12 bc1 bc12 bc123 c12 c123 123 Citroën NFD
test阿茲海默症初めまして한글 | test 阿茲 阿茲海 茲海 茲海默 海默 海默症 默症 初め まし まして して 한글
## Compilation & Installation
```
cd token-flinger
gradle assemble
sudo bin/elasticsearch-plugin install file://`readlink -f build/distributions/token-flinger-${VERSION}.zip`
```
-----------------------------
This project is sponsored by [Gamela Enterprise Co., Ltd.](https://www.gamela.com.tw)