Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/akabe/ocaml-mecab
MeCab binding for OCaml
https://github.com/akabe/ocaml-mecab
funcional-programming japanese-language machine-learning natural-language-processing ocaml part-of-speech-tagger
Last synced: about 1 month ago
JSON representation
MeCab binding for OCaml
- Host: GitHub
- URL: https://github.com/akabe/ocaml-mecab
- Owner: akabe
- License: mit
- Created: 2017-11-02T13:13:34.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-08-03T12:10:38.000Z (over 6 years ago)
- Last Synced: 2024-10-30T19:12:13.886Z (3 months ago)
- Topics: funcional-programming, japanese-language, machine-learning, natural-language-processing, ocaml, part-of-speech-tagger
- Language: OCaml
- Homepage: https://akabe.github.io/ocaml-mecab/
- Size: 87.9 KB
- Stars: 2
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ocaml-mecab
[![Build Status](https://travis-ci.org/akabe/ocaml-mecab.svg?branch=master)](https://travis-ci.org/akabe/ocaml-mecab)
An OCaml binding of MeCab, a part-of-speech and morphological analyzer.
MeCab is a open-sourced part-of-speech and morphological analyzer independent from languages, dictionaries, and corpuses, developed by a joint project team of
[Graduate School of Informatics in Kyoto University][GSI-KU] and
[NTT Communication Science Laboratories][NTT-CSL].
MeCab is almost faster than [ChaSen][ChaSen], [Juman][Juman], and [KAKASHI][KAKASHI].
The official documentation (written in Japanese) is published at [MeCab: Yet Another Part-of-Speech and Morphological Analyzer][MeCab-ja], and English-translated version is at [MeCab English Documentation][MeCab-en].[GSI-KU]: http://www.i.kyoto-u.ac.jp/en/
[NTT-CSL]: http://www.kecl.ntt.co.jp/rps/english/index_e.html
[ChaSen]: http://chasen-legacy.osdn.jp/
[Juman]: http://nlp.ist.i.kyoto-u.ac.jp/index.php?JUMAN
[KAKASHI]: http://kakasi.namazu.org/index.html.en
[MeCab-ja]: http://taku910.github.io/mecab/
[MeCab-en]: https://github.com/jordwest/mecab-docs-en/blob/master/README.md# Getting started
Install the latest-released version by
```
opam install mecab
```or the latest snapshot by `opam pin add mecab .`.
You can parse a sentense as follows:
```ocaml
# #require "mecab" ;;# let mecab = Mecab.Tagger.create [|""|];;
val mecab : Mecab.Tagger.t =# Mecab.Tagger.sparse_tostr mecab "すもももももももものうち" |> print_endline;;
すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
の 助詞,連体化,*,*,*,*,の,ノ,ノ
うち 名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
EOS- : unit = ()
```# Documentation
- API documentation: [https://akabe.github.io/ocaml-mecab/api/](https://akabe.github.io/ocaml-mecab/api/)