https://github.com/gagan8605/zepto_sql_analysis
This project explores and analyzes the inventory data of Zepto, a rapidly growing 10-minute grocery delivery platform in India. The dataset contains over 3,000+ SKUs across key product categories such as Fruits & Vegetables, Dairy, Beverages, Packaged Foods, and more. The analysis was performed using PostgreSQL, covering both data cleaning and bus
https://github.com/gagan8605/zepto_sql_analysis
cleaning-data data-analysis database-management postgresql sql
Last synced: 3 months ago
JSON representation
This project explores and analyzes the inventory data of Zepto, a rapidly growing 10-minute grocery delivery platform in India. The dataset contains over 3,000+ SKUs across key product categories such as Fruits & Vegetables, Dairy, Beverages, Packaged Foods, and more. The analysis was performed using PostgreSQL, covering both data cleaning and bus
- Host: GitHub
- URL: https://github.com/gagan8605/zepto_sql_analysis
- Owner: gagan8605
- Created: 2025-06-25T14:29:11.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-06-25T14:52:10.000Z (4 months ago)
- Last Synced: 2025-06-25T15:43:42.250Z (4 months ago)
- Topics: cleaning-data, data-analysis, database-management, postgresql, sql
- Homepage:
- Size: 80.1 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ๐ Zepto Inventory & Pricing Analysis (SQL Project)
This project involves analyzing the inventory and pricing dataset of **Zepto**, a rapidly expanding 10-minute grocery delivery service in India. The dataset includes over **3,000+ SKUs (Stock Keeping Units)** across categories like **Fruits & Vegetables, Dairy, Beverages, Packaged Foods**, and more.
Using **PostgreSQL**, we performed data cleaning, exploration, and multiple business intelligence queries to derive actionable insights about product pricing, stock levels, and category-wise trends.
---
## ๐ฆ Dataset Overview
- **Total SKUs**: 3,104
- **Columns**: `sku_id`, `category`, `name`, `mrp`, `discountPercent`, `availableQuantity`, `discountedSellPrice`, `weightInGms`, `outOfStock`, `quantity`---
## ๐งน Data Cleaning & Exploration
- โ **Null Check**: No null values found across critical columns.
- ๐ **Zero Price Removal**: Removed 21 SKUs with `mrp` or `discountedSellPrice` equal to 0.
- ๐ฑ **Unit Conversion**: Prices converted from *paise* to *rupees*.
- ๐ท๏ธ **Unique Categories**: 11
- ๐ **Stock Status**:
- In Stock: 2,612
- Out of Stock: 471 (โ15.16%)
- ๐ **Duplicate Product Names**: 112 names mapped to multiple SKUs.---
## ๐ Business Analysis Queries & Insights
### ๐ Top 10 Best Discounted Products
- Products with up to **60%+ discounts** across personal care and packaged items.### ๐ซ High MRP & Out-of-Stock Products
- 59 products with `MRP > โน300` were unavailable.
- Highest observed MRP: โน999.### ๐ฐ Estimated Revenue by Category
| Category | Revenue (โน) |
|---------------------|-------------|
| Packaged Foods | 1,14,230 |
| Beverages | 91,890 |
| Fruits & Vegetables | 85,140 |
| Dairy | 77,500 |
| Personal Care | 61,380 |### ๐งพ Premium Products with Low Discounts
- 48 products with `MRP > โน500` and `<10%` discount, typically premium goods.### ๐ Top 5 Categories by Avg. Discount
| Category | Avg. Discount (%) |
|-------------------|-------------------|
| Personal Care | 26.4 |
| Beverages | 23.1 |
| Snacks | 21.7 |
| Cleaning Supplies | 20.9 |
| Dairy | 18.3 |### โ๏ธ Best Value Products (โน per gram)
- Price efficiency calculated for products above 100g.
- Best value product: โน0.07/g (e.g. rice, sugar, detergent).### ๐ท๏ธ Product Weight Category
| Weight Category | Count |
|------------------|-------|
| Low (<1kg) | 1,876 |
| Medium (1โ5kg) | 845 |
| Bulk (>5kg) | 215 |### ๐๏ธ Total Inventory Weight by Category
| Category | Total Weight (kg) |
|---------------------|-------------------|
| Packaged Foods | 7,140 |
| Fruits & Vegetables | 5,410 |
| Dairy | 3,860 |
| Beverages | 2,970 |
| Personal Care | 2,115 |---
## ๐ Tech Stack
- **Database**: PostgreSQL 16
- **Tools**: Git, GitHub, Git Bash
- **Language**: SQL (PostgreSQL dialect)