An open API service indexing awesome lists of open source software.

https://github.com/saya304/data-cleaning-and-exploratory-data-analysis

Data Cleaning and Exploratory Data Analysis in Snowflake
https://github.com/saya304/data-cleaning-and-exploratory-data-analysis

data-cleansing exploratory-data-analysis snowflake sql

Last synced: 3 months ago
JSON representation

Data Cleaning and Exploratory Data Analysis in Snowflake

Awesome Lists containing this project

README

          

# **Data Cleaning & Exploratory Data Analysis in Snowflake**

## 📌 Project Overview
This project focuses on **data cleaning, transformation, and exploratory data analysis (EDA)** using SQL within the **Snowflake** environment. The goal is to efficiently clean and analyze raw data to extract meaningful insights.

---

## 📂 Files in This Repository
- 📄 [`data cleaning.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/data%20cleaning.sql) – SQL scripts for **cleaning and transforming raw data**
- 📄 [`exploratory data analysis.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/exploratory%20data%20analysis.sql) – SQL queries for **data exploration & insights**

---

## 🛠️ Tools & Technologies Used
- **Snowflake** – Cloud-based data warehousing
- **SQL** – Data cleaning, transformations, and analytics

---

## 📊 Key Steps & Techniques

### ✅ 1. Data Cleaning [`data cleaning.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/data%20cleaning.sql)
- Removing duplicates and NULL values
- Standardizing data formats
- Handling inconsistent entries

### 📈 2. Exploratory Data Analysis [`exploratory data analysis.sql`](https://github.com/saya304/Data-Cleaning-and-Exploratory-Data-Analysis/blob/main/exploratory%20data%20analysis.sql)
- Analyzing trends and patterns
- Aggregating and summarizing key metrics
- Using window functions for deeper insights

---

## 📌 Future Improvements
- 🔹 Automate data pipeline using Snowflake tasks
- 🔹 Integrate with visualization tools (Tableau, Power BI)
- 🔹 Expand analysis with additional datasets

---

## 📩 Contact & Credits
This project was initially inspired by the **portfolio project done by YouTuber [Alex The Analyst](https://www.youtube.com/c/AlexTheAnalyst) for MySQL**, but has been implemented using **Snowflake** instead.