https://github.com/mr-chang95/twitter_datawrangling
Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program
https://github.com/mr-chang95/twitter_datawrangling
data-visualization data-wrangling dogs matplotlib numpy pandas python twitter
Last synced: 3 months ago
JSON representation
Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program
- Host: GitHub
- URL: https://github.com/mr-chang95/twitter_datawrangling
- Owner: Mr-Chang95
- Created: 2021-12-14T21:05:31.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2022-02-08T21:01:01.000Z (over 4 years ago)
- Last Synced: 2025-01-26T15:34:12.203Z (over 1 year ago)
- Topics: data-visualization, data-wrangling, dogs, matplotlib, numpy, pandas, python, twitter
- Language: Jupyter Notebook
- Homepage:
- Size: 1.13 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Twitter Data Wrangling - WeRateDogs

Date First Uploaded: 12/14/21
## Project Overview
In this project, I gathered data from the archives of a Twitter account called [WeRateDogs](https://twitter.com/dog_rates) which rates owned dogs in their tweets and adds a humorous comment with it. My primary goal for this project is to practice the data analysis process, specifically the wrangling data phase. Here are some steps I took:
~~~~~
- Gathered retweet and like counts for all the tweets from the Twitter API using the access library tweepy
- Read the data into a pandas dataframe in the Jupyter Notebook.
- Downloaded a tsv file about the tweets programmatically using the requests and BeautifulSoup libraries in python.
- Assessed quality and tidiness issues in the gathered datasets and cleaned all of them using pandas functions.
- Analyzed the cleaned tables and created visualizations using the matplotlib library.
~~~~~
## Packages
~~~~~~
- pandas
- numpy
- matplotlib
- requests
- tweepy
- json
~~~~~
## Licensing, Authors, Acknowledgements
Special thanks to WeRateDogs for allowing me to work with their data. I would also like to thank Udacity for giving me this oppurtunity.