https://github.com/groonga/groonga-tokenizer-friso
https://github.com/groonga/groonga-tokenizer-friso
Last synced: 11 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/groonga/groonga-tokenizer-friso
- Owner: groonga
- License: lgpl-2.1
- Created: 2014-01-10T18:43:53.000Z (over 12 years ago)
- Default Branch: master
- Last Pushed: 2021-02-15T01:34:33.000Z (over 5 years ago)
- Last Synced: 2025-02-10T11:45:22.363Z (over 1 year ago)
- Language: Shell
- Size: 49.8 KB
- Stars: 1
- Watchers: 5
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: NEWS
- License: COPYING
- Authors: AUTHORS
Awesome Lists containing this project
README
# Friso tokenizer for Groonga
## Install
Install libgroonga-dev, friso in the advance.
* Groonga http://groonga.org/
* Friso https://github.com/lionsoul2014/friso
Build this tokenizer.
% ./configure --with-friso=PATH_TO_FRISO_DIR --with-friso-ini=PATH_TO_INI_FILE
% make
% sudo make install
Friso is installed into `/usr/lib` and `/usr/include/friso` by default. In this case, you should specify `--with-friso=/usr` option.
## Usage
Register `tokenizers/friso`:
% groonga DB
> register tokenizers/friso
[[0,0.0,0.0],true]
Now, you can use `TokenFriso` that tokenizes the sentence by each word segmentation:
> tokenize TokenFriso "Groonga是一款可嵌入式的全文搜寻引擎"
[
[
0,
0.0,
0.0
],
[
{
"value": "Groonga",
"position": 0
},
{
"value": "是",
"position": 1
},
{
"value": "一款",
"position": 2
},
{
"value": "可",
"position": 3
},
{
"value": "嵌入",
"position": 4
},
{
"value": "式",
"position": 5
},
{
"value": "的",
"position": 6
},
{
"value": "全文",
"position": 7
},
{
"value": "搜寻引擎",
"position": 8
}
]
]
## License
LGPLv2.1. It is same as Groonga.