https://github.com/saya304/data-cleaning-and-exploratory-data-analysis
Data Cleaning and Exploratory Data Analysis in Snowflake
https://github.com/saya304/data-cleaning-and-exploratory-data-analysis
data-cleansing exploratory-data-analysis snowflake sql
Last synced: 3 months ago
JSON representation
Data Cleaning and Exploratory Data Analysis in Snowflake
- Host: GitHub
- URL: https://github.com/saya304/data-cleaning-and-exploratory-data-analysis
- Owner: saya304
- Created: 2025-03-02T21:57:15.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-03T20:34:18.000Z (over 1 year ago)
- Last Synced: 2025-03-03T21:21:31.073Z (over 1 year ago)
- Topics: data-cleansing, exploratory-data-analysis, snowflake, sql
- Homepage:
- Size: 61.5 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# **Data Cleaning & Exploratory Data Analysis in Snowflake**
## 📌 Project Overview
This project focuses on **data cleaning, transformation, and exploratory data analysis (EDA)** using SQL within the **Snowflake** environment. The goal is to efficiently clean and analyze raw data to extract meaningful insights.
---
## 📂 Files in This Repository
- 📄 [`data cleaning.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/data%20cleaning.sql) – SQL scripts for **cleaning and transforming raw data**
- 📄 [`exploratory data analysis.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/exploratory%20data%20analysis.sql) – SQL queries for **data exploration & insights**
---
## 🛠️ Tools & Technologies Used
- **Snowflake** – Cloud-based data warehousing
- **SQL** – Data cleaning, transformations, and analytics
---
## 📊 Key Steps & Techniques
### ✅ 1. Data Cleaning [`data cleaning.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/data%20cleaning.sql)
- Removing duplicates and NULL values
- Standardizing data formats
- Handling inconsistent entries
### 📈 2. Exploratory Data Analysis [`exploratory data analysis.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/exploratory%20data%20analysis.sql)
- Analyzing trends and patterns
- Aggregating and summarizing key metrics
- Using window functions for deeper insights
---
## 📌 Future Improvements
- 🔹 Automate data pipeline using Snowflake tasks
- 🔹 Integrate with visualization tools (Tableau, Power BI)
- 🔹 Expand analysis with additional datasets
---
## 📩 Contact & Credits
This project was initially inspired by the **portfolio project done by YouTuber [Alex The Analyst](https://www.youtube.com/c/AlexTheAnalyst) for MySQL**, but has been implemented using **Snowflake** instead.