Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/allitnils/sentiment_analysis
In this project I have analysed Twitter tweets to predict people's sentiments. Predictions could be sentiment inferred from social media posts, reviews, or comments.
https://github.com/allitnils/sentiment_analysis
Last synced: 11 days ago
JSON representation
In this project I have analysed Twitter tweets to predict people's sentiments. Predictions could be sentiment inferred from social media posts, reviews, or comments.
- Host: GitHub
- URL: https://github.com/allitnils/sentiment_analysis
- Owner: allitnils
- Created: 2020-10-19T02:25:33.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2020-11-08T22:57:46.000Z (about 4 years ago)
- Last Synced: 2023-10-05T13:17:29.764Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 82.9 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# sentiment_analysis
In this project I will investigate the basics of Natural Language Processing (NLP) and aim to predict whether a sample of tweets are either positive or negative. This will consist of combining machine learning principles with text. I will use mathematics and statistics to get the text in a format that algorithms can understand.
Understanding the problem statement and business case
In this case study, we will analyse Twitter tweets to predict people’s sentiment
For instance:
TWEET: “Good morning everyone! Such a beautiful day!!” -> SENTIMENT ANALYSIS (NLP MODEL) -> SENTIMENT: POSITIVE (Label 0)
TWEET: “The food was awful and the waiters rude.” -> SENTIMENT ANALYSIS (NLP MODEL) -> SENTIMENT: NEGATIVE (Label 1)
The objective of this task is to detect hate speech in tweets. For the sake of simplicity, we say a tweet contains hate speech if it has a derogatory or negative sentiment associated with it.
Content
Full tweet texts are provided with their labels for training data.
Mentioned users will be redacted.
1,600,000 tweets extracted using the twitter api . The tweets have been annotated (0 = negative, 4 = positive) and they can be used to detect sentiment .
Data can be obtained from this kaggle dataset: https://www.kaggle.com/kazanova/sentiment140