{"id":26868748,"url":"https://github.com/githubasr2001/x_sentimentanalysis","last_synced_at":"2025-03-31T05:34:52.743Z","repository":{"id":222762261,"uuid":"758311994","full_name":"githubasr2001/X_SentimentAnalysis","owner":"githubasr2001","description":null,"archived":false,"fork":false,"pushed_at":"2024-02-16T03:30:17.000Z","size":1824,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-02-16T04:28:31.040Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/githubasr2001.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2024-02-16T03:27:23.000Z","updated_at":"2024-02-16T04:28:32.903Z","dependencies_parsed_at":"2024-02-16T04:28:32.698Z","dependency_job_id":"a2ce1abc-a786-4a92-9b4d-3b9c4fb39ffa","html_url":"https://github.com/githubasr2001/X_SentimentAnalysis","commit_stats":null,"previous_names":["githubasr2001/x_sentimentanalysis"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/githubasr2001%2FX_SentimentAnalysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/githubasr2001%2FX_SentimentAnalysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/githubasr2001%2FX_SentimentAnalysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/githubasr2001%2FX_SentimentAnalysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/githubasr2001","download_url":"https://codeload.github.com/githubasr2001/X_SentimentAnalysis/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246423524,"owners_count":20774796,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-03-31T05:34:52.086Z","updated_at":"2025-03-31T05:34:52.726Z","avatar_url":"https://github.com/githubasr2001.png","language":"Jupyter Notebook","readme":"\n# Twitter Sentiment Analysis on Vaccination Tweets\n\n## Project Overview\n\nThis project focuses on analyzing public sentiment towards vaccinations through tweets collected from Twitter. Using Python, NLP techniques, and machine learning, we aim to uncover the general public's opinions, concerns, and attitudes towards vaccination efforts globally.\n\n## Motivation\n\nIn the wake of global health challenges, understanding public sentiment towards vaccinations is crucial for policymakers, health organizations, and the general public. This project seeks to provide insights into these sentiments by analyzing tweets related to vaccinations.\n\n## Data Source\n\nThe dataset `vaccination_all_tweets.csv` contains tweets related to vaccinations, including user information and tweet content. This data serves as the foundation for our sentiment analysis.\n\n## Requirements\n\n- Python 3.x\n- Pandas\n- NumPy\n- NLTK\n- Scikit-learn\n- Matplotlib\n- Seaborn\n- TextBlob\n\n## Installation and Setup\n\nEnsure you have Python installed on your system. You can then install the necessary libraries using pip:\n\n```bash\npip install pandas numpy nltk scikit-learn matplotlib seaborn textblob\n```\n\n## Data Preprocessing\n\nThe preprocessing steps include:\n\n- Cleaning text data by removing URLs, mentions, and special characters.\n- Lowercasing all text for consistency.\n- Tokenizing the text and removing stopwords.\n- Applying lemmatization to get the root form of words.\n\n## Sentiment Analysis\n\nWe used TextBlob to assign sentiment polarity to each tweet, categorizing them into positive, negative, or neutral sentiments.\n\n## Machine Learning Model\n\nA Logistic Regression model was trained on TF-IDF vectorized tweet texts to predict the sentiment of tweets. We used `CountVectorizer` to convert text data into a matrix of token counts and `TfidfTransformer` to compute TF-IDF values.\n\n## Evaluation\n\nThe model's performance was evaluated using accuracy, precision, recall, and F1-score metrics.\n\n## Visualization\n\nWe visualized the distribution of sentiments and created word clouds to display the most frequent words in positive, negative, and neutral tweets.\n\n## How to Use\n\n1. Clone the repository to your local machine.\n2. Load your dataset or use the provided `vaccination_all_tweets.csv`.\n3. Run the preprocessing script to clean and prepare the data.\n4. Execute the sentiment analysis notebook to analyze and visualize sentiments.\n5. Train the machine learning model using the training script.\n\n## Future Work\n\n- Explore more advanced models like neural networks for better accuracy.\n- Implement other vectorization techniques like Word2Vec or GloVe.\n- Expand the dataset to include tweets in different languages.\n\n \n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgithubasr2001%2Fx_sentimentanalysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgithubasr2001%2Fx_sentimentanalysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgithubasr2001%2Fx_sentimentanalysis/lists"}