An open API service indexing awesome lists of open source software.

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program
https://github.com/mr-chang95/twitter_datawrangling

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 3 months ago
JSON representation

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

Awesome Lists containing this project

README

          

# Twitter Data Wrangling - WeRateDogs
Screen Shot 2022-02-08 at 12 57 55 PM

Date First Uploaded: 12/14/21

## Project Overview
In this project, I gathered data from the archives of a Twitter account called [WeRateDogs](https://twitter.com/dog_rates) which rates owned dogs in their tweets and adds a humorous comment with it. My primary goal for this project is to practice the data analysis process, specifically the wrangling data phase. Here are some steps I took:
~~~~~
- Gathered retweet and like counts for all the tweets from the Twitter API using the access library tweepy
- Read the data into a pandas dataframe in the Jupyter Notebook.
- Downloaded a tsv file about the tweets programmatically using the requests and BeautifulSoup libraries in python.
- Assessed quality and tidiness issues in the gathered datasets and cleaned all of them using pandas functions.
- Analyzed the cleaned tables and created visualizations using the matplotlib library.
~~~~~

## Packages
~~~~~~
- pandas
- numpy
- matplotlib
- requests
- tweepy
- json
~~~~~
## Licensing, Authors, Acknowledgements
Special thanks to WeRateDogs for allowing me to work with their data. I would also like to thank Udacity for giving me this oppurtunity.