Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/fabioba/netflix-analytics
Analyze personal Netflix usage
https://github.com/fabioba/netflix-analytics
airflow astronomer bigquery google-cloud-platform netflix tableau
Last synced: 10 days ago
JSON representation
Analyze personal Netflix usage
- Host: GitHub
- URL: https://github.com/fabioba/netflix-analytics
- Owner: fabioba
- Created: 2025-01-06T13:54:50.000Z (23 days ago)
- Default Branch: main
- Last Pushed: 2025-01-18T21:23:19.000Z (10 days ago)
- Last Synced: 2025-01-18T22:24:44.482Z (10 days ago)
- Topics: airflow, astronomer, bigquery, google-cloud-platform, netflix, tableau
- Language: Jupyter Notebook
- Homepage:
- Size: 1.08 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# netflix-analytics
Analyze personal Netflix usage- [Goal](#goal)
- [Business Questions](#business-questions)
- [Technical Guide](#technical-guide)
- [Data Flow](#data-flow)
- [Data Source](#data-source)
- [Data Storage](#data-storage)
- [Visualization](#visualization)## Goal
This project aims to analyze Netflix usage from the first day of subscription (April 2024) until the end of the year.### Business Questions
- what are the most watched movie category?
- how often did I watch movies per week?
- what is the most engaged movie genre?## Technical Guide
In order to reach the goal these are the steps to follow to develop the system:
1. ingest data from Netflix
2. for each movie, extract the details (via the [TMDB](https://www.themoviedb.org/) API)
3. create the star schema### Data Flow
![img](documentation/img/data_flow_adj.png)#### Data Sources
- Netflix data: the dataset extracted at the end of 2024 is related to the movies streamed on the platform (title, date of stream)
- TMDB API: extract movie details (release date, genre, ...)#### Data Storage
- Google Cloud Storage: store the RAW data (data lake)
- Google BigQuery: load raw data and transform (star schema)```
- DIM_MOVIE: ID_MOVIE (PK)
- DIM_GENRE: ID_GENRE (PK)
- FCT_STREAMING: FCT_STREAMING_ID (PK)
- BRIDGE_MOVIE_GENRE: ID_MOVIE (FK), ID_GENRE (FK)
```![img](documentation/img/physical_model.png)
#### Visualization
The outcome of the star schema is a layer that is read from [this](https://public.tableau.com/app/profile/fabi5755/viz/NetflixAnalytics_17372167014910/Dashboard1#1) dashboard on Tableau.
![img](documentation/viz/dashboard.png)