https://github.com/pythainlp/thai-text-generator
Thai Text Generator
https://github.com/pythainlp/thai-text-generator
thai-language thai-nlp thai-nlp-library
Last synced: 5 months ago
JSON representation
Thai Text Generator
- Host: GitHub
- URL: https://github.com/pythainlp/thai-text-generator
- Owner: PyThaiNLP
- License: apache-2.0
- Archived: true
- Created: 2020-05-23T13:08:43.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2024-12-24T13:45:11.000Z (10 months ago)
- Last Synced: 2025-04-19T22:02:23.152Z (6 months ago)
- Topics: thai-language, thai-nlp, thai-nlp-library
- Language: Python
- Size: 39.1 KB
- Stars: 4
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# TTG : Thai Text Generator
[](https://colab.research.google.com/drive/1X6D8J0sWNi8UgJi7Hk5YL4FqepZ7laxS?usp=sharing]Thai Text Generator
> Moved to https://github.com/PyThaiNLP/pythainlp
## Install
```sh
pip install ttg
```if you want used thai2fit, you can install :
```sh
pip install ttg[thai2fit]
```## Example
```python
from thaitextgenerator import *
a=Unigram()
print(a.gen_sentence(N=10))
b=Bigram()
print(b.gen_sentence(start_seq="คน", N=10))
c=Tigram()
c.gen_sentence(N=10, start_seq=("คน","ดี"))
# Thai2Fit
from thaitextgenerator.thai2fit import gen_sentence
gen_sentence()
```## Docs
### N-Gram
**import**
```python
from thaitextgenerator import *
```
#### Unigramwe support corpus
- TNC : Thai National Corpus [source](https://github.com/korakot/thainlp) (Default)
- TCC : Thai Textbook Corpus [source](https://github.com/korakot/thainlp)
- oscar : Open Super-large Crawled ALMAnaCH coRpus [source](https://www.facebook.com/groups/colab.thailand/permalink/1524070061101680/)```python
Unigram(name = "tnc or tcc or oscar")Unigram().gen_sentence(N:int=3,prob:float=0.001, start_seq:str=None ,output_str:bool = True, duplicate:bool=False)
```#### Bigram
we support corpus
- TNC : Thai National Corpus (Default) [source](http://www.arts.chula.ac.th/ling/tnc/)
```python
Bigram(name = "tnc")Bigram().gen_sentence(N:int=4,prob:float=0.001, start_seq:str=None, output_str:bool = True, duplicate:bool=False)
```#### Tigram
we support corpus
- TNC : Thai National Corpus (Default) [source](http://www.arts.chula.ac.th/ling/tnc/)
```python
Tigram(name = "tnc")Tigram().gen_sentence(N:int=4,prob:float=0.001, start_seq:tuple=None, output_str:bool = True, duplicate:bool=False)
```### Thai2Fit
[source](https://github.com/cstorm125/thai2fit)
```python
# Thai2Fit
from thaitextgenerator.thai2fit import gen_sentence
gen_sentence(N:int=4,prob:float=0.001, start_seq:str=None, output_str:bool = True)
```## Citations
```
Wannaphong Phatthiyaphaibun. (2020, May 24). TTG: Thai Text Generator. Zenodo. http://doi.org/10.5281/zenodo.3855426
```BibTeX entry:
``` bib
@software{wannaphong_phatthiyaphaibun_2020_3855427,
author = {Wannaphong Phatthiyaphaibun},
title = {{TTG: Thai Text Generator}},
month = may,
year = 2020,
publisher = {Zenodo},
doi = {10.5281/zenodo.3855426},
url = {https://doi.org/10.5281/zenodo.3855426}
}
```## License
```
Copyright 2020 Wannaphong PhatthiyaphaibunLicensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License athttp://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
```