An open API service indexing awesome lists of open source software.

https://github.com/buganini/elasticsearch-token-flinger-plugin

A smart elasticsearch token filter plugin
https://github.com/buganini/elasticsearch-token-flinger-plugin

Last synced: 10 months ago
JSON representation

A smart elasticsearch token filter plugin

Awesome Lists containing this project

README

          

Token-Flinger try to identify the type of token then apply proper filter on it.

## Current implementation

Type | Filter
-----|-------
Latin | Unchanged
CJK | NGram
Other | NGram

## Example (test.py)

Input | Output
------|-------
test abc5566 | test abc abc5 abc55 bc5 bc55 bc556 c55 c556 c5566 556 5566 566
abc123 Citroën (NFC) | abc abc1 abc12 bc1 bc12 bc123 c12 c123 123 Citroën NFC
abc123 Citroën (NFD) | abc abc1 abc12 bc1 bc12 bc123 c12 c123 123 Citroën NFD
test阿茲海默症初めまして한글 | test 阿茲 阿茲海 茲海 茲海默 海默 海默症 默症 初め まし まして して 한글

## Compilation & Installation

```
cd token-flinger
gradle assemble
sudo bin/elasticsearch-plugin install file://`readlink -f build/distributions/token-flinger-${VERSION}.zip`
```

-----------------------------
This project is sponsored by [Gamela Enterprise Co., Ltd.](https://www.gamela.com.tw)