Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/kyubyong/koparadigm
KoParadigm: Korean Inflectional Paradigm Generator
https://github.com/kyubyong/koparadigm
inflection korean linguistics morphology nlp paradigm
Last synced: 5 days ago
JSON representation
KoParadigm: Korean Inflectional Paradigm Generator
- Host: GitHub
- URL: https://github.com/kyubyong/koparadigm
- Owner: Kyubyong
- License: apache-2.0
- Created: 2019-06-24T18:26:09.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2022-11-23T09:08:31.000Z (almost 2 years ago)
- Last Synced: 2024-11-01T20:12:28.458Z (13 days ago)
- Topics: inflection, korean, linguistics, morphology, nlp, paradigm
- Language: Python
- Homepage:
- Size: 6.23 MB
- Stars: 55
- Watchers: 5
- Forks: 7
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# KoParadigm: A Korean Conjugation Paradigm Generator
This is the offical repo for our paper: [KoParadigm: A Korean Conjugation Paradigm Generator](https://arxiv.org/abs/2004.13221)
(Inflectional) paradigm means the set of all the inflected forms of a word. For example, English verb "look" has inflected forms like "look", "look-s", "look-ed", and "look-ing", as all of you know.
Paradigms are widely used in corpus linguistics or search engines.
To create the full paradigm set of a language is sometimes tricky. It is particularly so when we deal with a morphologically rich language like Korean.
Inflection of Korean verbs is notorisouly complicated. Typically, a Korean verb can combine with more than 100 endings. What is worse, the combination rules are not simple at all.
They are determined by the sound of the verb/ending, and the part-of-speech of the verb (action / descriptive). That's why so far there's no open sources of Korean paradigm generator, I think.
Here's the first one. With KoParadigm, you can easily get the full paradigm of a Korean verb.
## Dependencies
* python >=3.6
* jamo >=0.4.1
* xlrd == 1.2.0## Installation
```
pip install koparadigm
```## Usage
```
>>> from koparadigm import Paradigm, prettify
>>> p = Paradigm()
>>> verb = "곱" # Note that you must drop the final ending 다
>>> paradigms = p.conjugate(verb) # this returns list of lists
>>> print(paradigms)
[['Action Verb', [('거나', '곱거나'), ('거늘', '곱거늘'), ('거니', '곱거니') ...]]]
>>> prettify(paradigms)
POS = Action Verb
• ending = 거나 form = 곱거나
• ending = 거늘 form = 곱거늘
• ending = 거니 form = 곱거니
...
==================== 2 ====================
POS = Descriptive Verb
• ending = 거나 form = 곱거나
• ending = 거늘 form = 곱거늘
• ending = 거니 form = 곱거니
• ending = 거니와 form = 곱거니와
...```
## References
If you use our software for research, please cite:```
@article{park2020KoParadigm,
author = {Park, Kyubyong },
title={KoParadigm: A Korean Conjugation Paradigm Generator},
journal={arXiv preprint arXiv:2004.13221},
year={2020}
}
```