https://github.com/sirnicholas1st/nlp_classification
Creating a Steam review classifier with God of War data that was scrapped in a previous project.
https://github.com/sirnicholas1st/nlp_classification
Last synced: 23 days ago
JSON representation
Creating a Steam review classifier with God of War data that was scrapped in a previous project.
- Host: GitHub
- URL: https://github.com/sirnicholas1st/nlp_classification
- Owner: SirNicholas1st
- Created: 2023-05-01T05:47:07.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-05-05T16:27:34.000Z (over 2 years ago)
- Last Synced: 2025-03-04T19:39:37.314Z (7 months ago)
- Language: Jupyter Notebook
- Size: 3.18 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# nlp_classification
This project involves training a Spacy NLP model on a dataset of Steam reviews to classify them as positive or negative.## Data Collection
The first step of the project is collecting data from the Steam website. The reviews were scraped in a previous project.## Data Cleaning
The dataset is then cleaned by dropping empty review rows and columns that won't be needed for NLP. The data is checked for NaN values and balanced using an equal number of positive and negative reviews.## Training the Model
The cleaned data is used to train a Spacy NLP model using binary classification. The model is trained to assign binary data to reviews, i.e., recommended or not recommended.## Testing the Model
The saved model is then loaded and tested for a few reviews.## Conclusion
This project showcases how Spacy NLP can be used to train a model to classify Steam reviews as positive or negative. The model can be further fine-tuned to improve accuracy, and the techniques used here can be applied to other NLP classification tasks.