{"id":23879687,"url":"https://github.com/sayed-ashfaq/netflix_datacleaning","last_synced_at":"2025-08-25T16:15:40.930Z","repository":{"id":269238840,"uuid":"906822329","full_name":"sayed-ashfaq/Netflix_DataCleaning","owner":"sayed-ashfaq","description":" Netflix data analysis highlights significant null values, requiring cleaning and visualization to uncover viewer trends and regional insights. Improving data quality, personalizing recommendations, and targeting untapped markets can drive growth and profitability.","archived":false,"fork":false,"pushed_at":"2024-12-22T02:29:06.000Z","size":2308,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-03T23:34:39.479Z","etag":null,"topics":["matplotlib-pyplot","numpy-library","pandas-library","python","seaborn-plots"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/sayed-ashfaq.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-12-22T02:21:06.000Z","updated_at":"2024-12-22T02:32:23.000Z","dependencies_parsed_at":null,"dependency_job_id":"a3d3016b-169e-4019-bdad-d555bdb2ad80","html_url":"https://github.com/sayed-ashfaq/Netflix_DataCleaning","commit_stats":null,"previous_names":["sayed-ashfaq/netflix_datacleaning"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sayed-ashfaq%2FNetflix_DataCleaning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sayed-ashfaq%2FNetflix_DataCleaning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sayed-ashfaq%2FNetflix_DataCleaning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sayed-ashfaq%2FNetflix_DataCleaning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/sayed-ashfaq","download_url":"https://codeload.github.com/sayed-ashfaq/Netflix_DataCleaning/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240250355,"owners_count":19771778,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["matplotlib-pyplot","numpy-library","pandas-library","python","seaborn-plots"],"created_at":"2025-01-03T23:32:14.427Z","updated_at":"2025-02-22T23:43:14.899Z","avatar_url":"https://github.com/sayed-ashfaq.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Netflix Dataset Analysis\n\n## About Netflix\nNetflix is one of the world's most popular media and video streaming platforms. As of mid-2021, Netflix offers over **10,000 movies and TV shows** and has amassed over **222 million subscribers globally**.  \n\nThis dataset contains a detailed listing of all movies and TV shows available on Netflix, including attributes like cast, directors, ratings, release year, duration, and more. By analyzing this dataset, we aim to generate insights that can help Netflix make data-driven decisions for future content production and business expansion.\n\n---\n\n## Business Problem\nThe primary goal is to analyze the dataset and derive actionable insights that could help Netflix:\n1. Decide which type of shows/movies to produce.\n2. Strategize growth opportunities in different countries.\n\n---\n\n### **Dataset Details**\nThe dataset consists of a comprehensive list of TV shows and movies available on Netflix. Below are the attributes included:\n\n- **`Show_id`**: Unique ID for every Movie/TV Show.\n- **`Type`**: Identifier specifying whether it's a Movie or TV Show.\n- **`Title`**: Title of the Movie/TV Show.\n- **`Director`**: Director of the Movie.\n- **`Cast`**: Actors involved in the Movie/Show.\n- **`Country`**: Country where the Movie/Show was produced.\n- **`Date_added`**: Date the content was added to Netflix.\n- **`Release_year`**: The year the content was originally released.\n- **`Rating`**: TV Rating of the content (e.g., PG, R, TV-MA).\n- **`Duration`**: Total duration, either in minutes (for movies) or the number of seasons (for TV shows).\n- **`Listed_in`**: Genre(s) the content belongs to.\n- **`Description`**: A brief summary of the content.\n\n---\n\n## Objective\nThe analysis will focus on:\n- Understanding the distribution of content across genres, ratings, and countries.\n- Identifying trends in content addition and production.\n- Generating actionable insights to guide Netflix in producing shows/movies and expanding its presence in different regions.\n\n---\n\n## Tools and Libraries\nFor analysis, you can use:\n- **Python Libraries**:\n  - `Pandas` for data manipulation.\n  - `Matplotlib` and `Seaborn` for visualization.\n  - `NumPy` for numerical operations.\n  - `Scikit-learn` for machine learning models, if applicable.\n\n---\n\n## Future Scope\nThis project can be extended by:\n- Building recommendation systems based on user preferences.\n- Predicting trends in viewership for future Netflix productions.\n- Exploring the correlation between ratings, genres, and regional success.\n\n---\n\n## License\nThis project is open for academic and non-commercial purposes. Ensure to credit the dataset source where applicable.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsayed-ashfaq%2Fnetflix_datacleaning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsayed-ashfaq%2Fnetflix_datacleaning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsayed-ashfaq%2Fnetflix_datacleaning/lists"}