{"id":19427866,"url":"https://github.com/gappeah/youtube-market-data-analysis","last_synced_at":"2025-02-25T05:22:32.792Z","repository":{"id":233880160,"uuid":"787630379","full_name":"gappeah/YouTube-Market-Data-Analysis","owner":"gappeah","description":"A project that pulling data from Kaggle, exploring and analysing it in Excel, cleaning and testing it in SQL, and visualizing it in Power BI.","archived":false,"fork":false,"pushed_at":"2024-10-19T15:19:55.000Z","size":15,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-07T10:44:00.230Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/gappeah.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-04-16T22:16:19.000Z","updated_at":"2024-10-19T15:19:58.000Z","dependencies_parsed_at":"2024-11-10T18:01:21.267Z","dependency_job_id":null,"html_url":"https://github.com/gappeah/YouTube-Market-Data-Analysis","commit_stats":null,"previous_names":["gappeah/animo.io","gappeah/excel-to-power-bi-portfolio-project-full-end-to-end-data-project","gappeah/youtube-market-data-analysis","gappeah/nucleic-acid-converter-and-secondary-structure-predictor"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gappeah%2FYouTube-Market-Data-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gappeah%2FYouTube-Market-Data-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gappeah%2FYouTube-Market-Data-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gappeah%2FYouTube-Market-Data-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/gappeah","download_url":"https://codeload.github.com/gappeah/YouTube-Market-Data-Analysis/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240607635,"owners_count":19828272,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-10T14:13:07.236Z","updated_at":"2025-02-25T05:22:32.760Z","avatar_url":"https://github.com/gappeah.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# YouTube Market Data Analysis\n\n## Project Overview\nThis project is a comprehensive data analysis workflow that demonstrates how to transform raw data from Excel into actionable insights using SQL and Power BI. The dataset used contains information about top-performing YouTube channels in the UK, with data sourced from a publicly available platform.\n\nThe workflow consists of data extraction, cleaning, transformation, and visualization, designed to simulate a real-world marketing analysis scenario for a client aiming to identify YouTube influencers for marketing campaigns.\n\n## Table of Contents\n1. [Project Architecture](#project-architecture)\n2. [Dataset Description](#dataset-description)\n3. [Steps Involved](#steps-involved)\n4. [Data Cleaning](#data-cleaning)\n5. [Data Visualization](#data-visualization)\n6. [Recommendations](#recommendations)\n7. [How to Run](#how-to-run)\n8. [Contributing](#contributing)\n9. [License](#license)\n\n## Project Architecture\nThe project involves three key technologies:\n- **Excel**: The raw data is sourced from an Excel file containing YouTube channel information.\n- **SQL**: SQL Server is used to clean, transform, and perform quality checks on the data.\n- **Power BI**: A dashboard is created to visualize insights, including top YouTubers by subscriber count, views, and video uploads.\n\n## Dataset Description\nThe dataset contains the following fields:\n- Channel name\n- Subscriber count\n- Views\n- Videos uploaded\n\nThe dataset was sourced from a data platform and extracted via a Python script using the YouTube API.\n\n## Steps Involved\n\n1. **Data Extraction**: \n   - The initial data is downloaded from an external source in Excel format.\n   - A Python script is used to pull supplementary data from the YouTube API.\n\n2. **Data Cleaning**:\n   - Unnecessary columns are removed.\n   - Data is cleaned to ensure consistency and accuracy.\n\n3. **Data Transformation**:\n   - Data types for each column are validated.\n   - SQL queries are used to transform the dataset into the required format for analysis.\n\n4. **Data Quality Checks**:\n   - Row and column counts are verified.\n   - Data types are validated.\n   - Duplicate records are checked.\n\n5. **Data Visualization**:\n   - The cleaned data is imported into Power BI.\n   - A dashboard is created with visualizations such as tables, tree maps, and bar charts to show top YouTubers by subscriber count, video uploads, and views.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgappeah%2Fyoutube-market-data-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgappeah%2Fyoutube-market-data-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgappeah%2Fyoutube-market-data-analysis/lists"}