{"id":26264555,"url":"https://github.com/saya304/data-cleaning-and-exploratory-data-analysis","last_synced_at":"2026-03-16T08:36:51.402Z","repository":{"id":280348439,"uuid":"941695497","full_name":"saya304/Data-Cleaning-and-Exploratory-Data-Analysis","owner":"saya304","description":"Data Cleaning and Exploratory Data Analysis in Snowflake","archived":false,"fork":false,"pushed_at":"2025-03-03T20:34:18.000Z","size":63,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-03T21:21:31.073Z","etag":null,"topics":["data-cleansing","exploratory-data-analysis","snowflake","sql"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/saya304.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-03-02T21:57:15.000Z","updated_at":"2025-03-03T20:34:21.000Z","dependencies_parsed_at":"2025-03-03T21:21:31.629Z","dependency_job_id":null,"html_url":"https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis","commit_stats":null,"previous_names":["saya304/data-cleaning-eda-project","saya304/data-cleaning-and-exploratory-data-analysis"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saya304%2FData-Cleaning-and-Exploratory-Data-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saya304%2FData-Cleaning-and-Exploratory-Data-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saya304%2FData-Cleaning-and-Exploratory-Data-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/saya304%2FData-Cleaning-and-Exploratory-Data-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/saya304","download_url":"https://codeload.github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243510103,"owners_count":20302295,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-cleansing","exploratory-data-analysis","snowflake","sql"],"created_at":"2025-03-14T02:16:26.586Z","updated_at":"2025-12-28T09:20:52.788Z","avatar_url":"https://github.com/saya304.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# **Data Cleaning \u0026 Exploratory Data Analysis in Snowflake**\n\n## 📌 Project Overview  \nThis project focuses on **data cleaning, transformation, and exploratory data analysis (EDA)** using SQL within the **Snowflake** environment. The goal is to efficiently clean and analyze raw data to extract meaningful insights.  \n\n---\n\n## 📂 Files in This Repository  \n- 📄 [`data cleaning.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/data%20cleaning.sql) – SQL scripts for **cleaning and transforming raw data**  \n- 📄 [`exploratory data analysis.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/exploratory%20data%20analysis.sql) – SQL queries for **data exploration \u0026 insights**    \n\n---\n\n## 🛠️ Tools \u0026 Technologies Used  \n- **Snowflake** – Cloud-based data warehousing  \n- **SQL** – Data cleaning, transformations, and analytics  \n\n---\n\n## 📊 Key Steps \u0026 Techniques  \n\n### ✅ 1. Data Cleaning [`data cleaning.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/data%20cleaning.sql)  \n- Removing duplicates and NULL values  \n- Standardizing data formats  \n- Handling inconsistent entries  \n\n### 📈 2. Exploratory Data Analysis [`exploratory data analysis.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/exploratory%20data%20analysis.sql)  \n- Analyzing trends and patterns  \n- Aggregating and summarizing key metrics  \n- Using window functions for deeper insights  \n\n---\n\n## 📌 Future Improvements  \n- 🔹 Automate data pipeline using Snowflake tasks  \n- 🔹 Integrate with visualization tools (Tableau, Power BI)  \n- 🔹 Expand analysis with additional datasets  \n\n---\n\n## 📩 Contact \u0026 Credits  \nThis project was initially inspired by the **portfolio project done by YouTuber [Alex The Analyst](https://www.youtube.com/c/AlexTheAnalyst) for MySQL**, but has been implemented using **Snowflake** instead.  \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsaya304%2Fdata-cleaning-and-exploratory-data-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsaya304%2Fdata-cleaning-and-exploratory-data-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsaya304%2Fdata-cleaning-and-exploratory-data-analysis/lists"}