An open API service indexing awesome lists of open source software.

https://github.com/howl-anderson/microtagger

一个微型的用于提取 Part-Of-Speech (POS) 的 Python 包 | A micro python library for NLP Tagger of Part-Of-Speech (POS)
https://github.com/howl-anderson/microtagger

hmm-viterbi-algorithm nlp nlp-parsing part-of-speech-tagger

Last synced: 4 months ago
JSON representation

一个微型的用于提取 Part-Of-Speech (POS) 的 Python 包 | A micro python library for NLP Tagger of Part-Of-Speech (POS)

Awesome Lists containing this project

README

        

[中文版本的 README](README.md)
------------------------------

# MicroTagger
A micro python library for NLP Tagger of Part-Of-Speech (POS)

# Installation
```bash
pip install -r requirements.txt
```

# Python version
Only test by using Python3

# Usage
```python
#!/usr/bin/env python

from MicroTagger.hmm import HMMTagger

hmm_tagger = HMMTagger.load()

result = hmm_tagger.predict("知识 就是 力量 。")
print(result)
```

Output:
```text
知识/m 就是/d 力量/n 。/wj
```

# Performance

| ID | Model Name | Accuracy |
|:---|:-----------------------------|:---------|
| 1 | Most Frequent Class Baseline | 0.83703 |
| 2 | HMM | 0.92339 |

# Reference
[Speech and Language Processing > Part-of-Speech Tagging]([](https://web.stanford.edu/~jurafsky/slp3/10.pdf))