Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/yashika-malhotra/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.
https://github.com/yashika-malhotra/data-exploration-and-visualization-for-streaming-platform

colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: about 1 month ago
JSON representation

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

Awesome Lists containing this project

README

        

# Data Exploration and Visualization for Streaming Platform

![NetflixIntroGIF (2)](https://github.com/yashika-malhotra/Netflix-Data-Exploration-and-Visualization/assets/154385962/2b675d0e-d401-4a8f-a2d0-1974fcae42ca)

## Streaming Platform Data: Cleaning, Analysis and Visualization
Data Analysis and Visualization on Netflix Data to provide insights and recommendations to improve their userbase.

Column | #Description |
--- | ---
Show_id | Unique ID for every Movie / Tv Show |
Type | Identifier - A Movie or TV Show |
Title | Title of the Movie / Tv Show |
Director | Director of the Movie |
Cast | Actors involved in the movie/show |
Country | Country where the movie/show was produced |
Date_added | Date it was added on Netflix |
Release_year | Actual Release year of the movie/show |
Rating | TV Rating of the movie/show |
Duration | Total Duration - in minutes or number of seasons |
listed_in | Genre |
description | The summary description |

Netflix is one of the most popular media and video streaming platforms. They have over 10000 movies or tv shows available on their platform, as of mid-2021, they have over 222M Subscribers globally. This tabular dataset consists of listings of all the movies and tv shows available on Netflix, along with details such as - cast, directors, ratings, release year, duration, etc.

Aim while exploring this dataset is to analyze the data and generate insights that could help in deciding which type of shows or movies to produce and how they can grow the business in different countries.

We are interested in increasing the revenue of Netflix, hence our main objective is to figure out which all shows and movies performed the best.

## Performed following Tasks
1. Data Cleaning
2. Analysis
3. Visualization

- Observations on the shape of data, data types of all the attributes, conversion of categorical attributes to 'category', missing value detection, statistical summary

- Non-Graphical Analysis: Value counts and unique attributes ​
- Visual Analysis - Univariate, Bivariate after pre-processing of the data
- For continuous variable(s): Distplot, countplot, histogram for univariate analysis
- For categorical variable(s): Boxplot
- For correlation: Heatmaps, Pairplots
- Missing Value & Outlier check

- Insights based on Non-Graphical and Visual Analysis

. Comments on the range of attributes

. Comments on the distribution of the variables and relationship between them

. Comments for each univariate and bivariate plot

- Business Insights & Recommendations : Includes patterns observed in the data along with what can infer from it

### Description about files in repository Backhand Index Pointing Down Light Skin Tone:

**Netflix_Business_Case_Study.ipynb** - Colaboratory notebook containing the code for analysis