Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/callmemaverick/game-of-thrones-investigating-episodes
Data Science project to analyze the duration of Game of Thrones episodes
https://github.com/callmemaverick/game-of-thrones-investigating-episodes
data-science data-visualization matplotlib pandas-python python
Last synced: about 7 hours ago
JSON representation
Data Science project to analyze the duration of Game of Thrones episodes
- Host: GitHub
- URL: https://github.com/callmemaverick/game-of-thrones-investigating-episodes
- Owner: CallMeMaverick
- Created: 2023-08-31T13:05:14.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-08-31T13:38:53.000Z (about 1 year ago)
- Last Synced: 2023-09-01T06:27:16.787Z (about 1 year ago)
- Topics: data-science, data-visualization, matplotlib, pandas-python, python
- Language: Python
- Homepage:
- Size: 12.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ⚔️ Game of Thrones; Investigating Episodes ⚔️
#### After DataCamp Introductory courses
[DataCamp Project](https://app.datacamp.com/workspace/w/08ac9c7a-ede5-40d4-9e0d-35f1b49a5361)Little yet fascinating Data Science project where data about Game of Thrones episodes is being analyzed and visualized.
## 📚 Description
In this project, I have applied the skills I gained in "Introduction to Python" and "Intermediate Python" at DataCamp Career Track. The goal was to explore, analyze, and visualize Game of Thrones data (I'm a big fan), particularly data about episodes and their duration. Pandas, Matplotlit, and other entry-level material from the "Introduction to Python" course had been used.
This project has been aimed at consolidating my knowledge and getting a grasp of solving real-world data science problems.
#### Data has been taken from [data.world](https://data.world/rezaghari/game-of-thrones)
## 🚀 Project Task
#### 1. Overall Episode Duration by Season
* Load data and inspect it
* Create a DataFrame with Durations
* Kinda "clean" data and get a convenient data format to work with
* Visualize data#### 2. Visualizing Each Episode's Duration by Season
* Create a list containing sub-lists corresponding to a season
* Flatten durations (check code for details)
* Define colors for a scatter plot
* Create a scatter plot## 🤝 Contributing
Any contribution is welcome, so please feel free to collaborate on this project!