Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/comsavvy/weratedogs-wrangling-and-visualization-analysis
Analysis on WeRateDogs tweets
https://github.com/comsavvy/weratedogs-wrangling-and-visualization-analysis
exploratory-data-analysis ipython-notebook python visualization weratedogs wrangling-data
Last synced: about 1 month ago
JSON representation
Analysis on WeRateDogs tweets
- Host: GitHub
- URL: https://github.com/comsavvy/weratedogs-wrangling-and-visualization-analysis
- Owner: comsavvy
- Created: 2022-08-15T15:31:38.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-09-13T12:22:09.000Z (over 2 years ago)
- Last Synced: 2024-10-12T14:43:47.311Z (2 months ago)
- Topics: exploratory-data-analysis, ipython-notebook, python, visualization, weratedogs, wrangling-data
- Language: HTML
- Homepage:
- Size: 4.33 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# **Data Wrangling and Visualization on WeRateDogs Tweets Information**
## **Introduction**
WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though almost always greater than 10, e.g., 11/10, 12/10, 13/10, etc., Why? Because "they're good dogs".WeRateDogs has over 4 million followers and has received international media coverage.
Since real-world datasets rarely come clean, I will be using Python and its libraries to gather data from a variety of sources and in a variety of formats, assess its quality and tidiness, and then clean it. This is called **data wrangling**!
## **Data description**
**Twitter archive**: The WeRateDogs Twitter archive contains basic tweet data for all 5000+ of their tweets, but not everything. One column in the archive does contain each tweet's text where ratings, dog name, and dog stage (i.e., doggo, floofer, pupper, and puppo) were extracted. To make this Twitter archive enhanced, out of the 5000+ tweets, only those with ratings were filtered (2366).
**Image prediction**: Every image in the WeRateDogs Twitter archive was classified into different breeds using a neural network algorithm. The results: a table full of image predictions (the top three only) alongside each tweet ID, image URL, and the image number that corresponded to the most confident prediction (numbered 1 to 4, since tweets can have up to four images).
**Additional tweet data**: Additional tweet information will be acquired from Twitter to address and bring out various insights on WeRateDog tweets, such as the number of likes and retweets, etc.
## Analysis
Kindly click [here](https://comsavvy.github.io/WeRateDogs-Wrangling-and-Visualization-Analysis/) to see the analysis in action.
Enjoy!