Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/aaaastark/textclassification-nlp-project

Dataset preparation, Feature Engineering, Model Building, Summary Statistics, Data Exploration by Visualizations, and Save File CSV
https://github.com/aaaastark/textclassification-nlp-project

matplotlib nltk numpy pandas python seaborn sklearn textblob wordcloud

Last synced: about 1 month ago
JSON representation

Dataset preparation, Feature Engineering, Model Building, Summary Statistics, Data Exploration by Visualizations, and Save File CSV

Awesome Lists containing this project

README

        

# TextClassification-NLP-Project
Dataset preparation, Feature Engineering, Model Building, Summary Statistics, Data Exploration by Visualizations, and Save File CSV.

## NOTE: Open the GPU (Google Colab)

# 1. Dataset preparation
* Data Set Load (Movies - TV)
* Train, Test and Encoder (Train and Validation)

# 2. Feature Engineering
2.1 Count Vectors as features

2.2 TF-IDF Vectors as features
* Word level
* N-Gram level
* Character level

2.3 Text / NLP based features

# 3. Model Building
3.1 Naive Bayes Classifier (Accuray, Confusion Matrix, Precision, Recall Score, F1 Score)

3.2 Linear Classifier (Accuray, Confusion Matrix, Precision, Recall Score, F1 Score)

3.3 Support Vector Machine (Accuray, Confusion Matrix, Precision, Recall Score, F1 Score)

# 4. Summary Statistics

# 5. Data Exploration by Visualizations

5.1 Word Density

5.2 Rating Text

5.3 Create Word Clouds

* 5.3.1 Word Cloud for Rating (Label) Score is 5, 4, 3, 2, 1

# 6. Save File CSV