Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/offchan42/thai-thesis-classification

Classify each document inside the corpus using Python machine learning module: scikit-learn
https://github.com/offchan42/thai-thesis-classification

nlp python python2 scikit-learn segment thai thai-language thai-thesis-classification

Last synced: 7 days ago
JSON representation

Classify each document inside the corpus using Python machine learning module: scikit-learn

Awesome Lists containing this project

README

        

# Thai-thesis-classification
Classify each document based on its actual class. _Supervised Learning_ :sparkles:

All .ipynb files can be opened using IPython notebook (currently known as Jupyter notebook)

## Follow files in this order:
1. change-file-encoding
2. segment-words
3. classify-docs
4. segment-words-unlabeled
5. predict-new-docs
6. predict-plagiarism-docs