https://github.com/hungreeee/twitter-sentiment-classification

Twitter sentiment classification using multiple Scikit-learn models and PyTorch neural networks.
https://github.com/hungreeee/twitter-sentiment-classification

deep-learning machine-learning nlp pytorch sentiment-classification sklearn

Last synced: 3 months ago
JSON representation

Twitter sentiment classification using multiple Scikit-learn models and PyTorch neural networks.

Host: GitHub
URL: https://github.com/hungreeee/twitter-sentiment-classification
Owner: Hungreeee
Created: 2023-08-29T20:41:28.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2023-10-18T16:24:49.000Z (over 1 year ago)
Last Synced: 2024-11-13T00:07:57.948Z (5 months ago)
Topics: deep-learning, machine-learning, nlp, pytorch, sentiment-classification, sklearn
Language: Jupyter Notebook
Homepage:
Size: 81.3 MB
Stars: 3
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Twitter-Sentiment-Classification

Twitter is a great open source of textual data for training specific natural language processing tasks, considering that there are millions of users expressing their own opinions, emotions or experiences regarding various matters every day. For this, Twitter sentiment analysis is chosen to be the task of interest by many researchers or marketers to investigate how people feel towards certain topics, products, brands, etc.

Within the scope of the project, however, we will solely focus on building, assessing and comparing several machine learning models that are trained specifically for sentiment analysis. The complexity of the models varies from as simple as a Logistic Regression model to a higher level such as the GRU. The main point of the project is to understand how different levels of model complexity affect the classification result with the available resources, given that the average sequence length is short and can be easily modelled. The project should show how the model complexity choice reflects the task complexity, and hopefully give us insights into the importance of pre-evaluation of tasks before modelling.

The project will consist of the following stages:

- Data cleaning:
- Removing noises.
- Negation handling.
- Data modelling
- Scikit-learn models:
- TF-IDF encoding.
- Logistic Regression model.
- Naive Bayes model.
- PyTorch models:
- GloVe encoding.
- Feed-forward neural network (FNN).
- Bi-directional Recurrent neural network (RNN).
- Convolutional neural network (CNN).
- Bi-directional Gated Recurrent unit (GRU).
- Model evaluation and comparisons

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/hungreeee/twitter-sentiment-classification

Awesome Lists containing this project

README