An open API service indexing awesome lists of open source software.

https://github.com/gagan8605/zepto_sql_analysis

This project explores and analyzes the inventory data of Zepto, a rapidly growing 10-minute grocery delivery platform in India. The dataset contains over 3,000+ SKUs across key product categories such as Fruits & Vegetables, Dairy, Beverages, Packaged Foods, and more. The analysis was performed using PostgreSQL, covering both data cleaning and bus
https://github.com/gagan8605/zepto_sql_analysis

cleaning-data data-analysis database-management postgresql sql

Last synced: 3 months ago
JSON representation

This project explores and analyzes the inventory data of Zepto, a rapidly growing 10-minute grocery delivery platform in India. The dataset contains over 3,000+ SKUs across key product categories such as Fruits & Vegetables, Dairy, Beverages, Packaged Foods, and more. The analysis was performed using PostgreSQL, covering both data cleaning and bus

Awesome Lists containing this project

README

          

# ๐Ÿ›’ Zepto Inventory & Pricing Analysis (SQL Project)

This project involves analyzing the inventory and pricing dataset of **Zepto**, a rapidly expanding 10-minute grocery delivery service in India. The dataset includes over **3,000+ SKUs (Stock Keeping Units)** across categories like **Fruits & Vegetables, Dairy, Beverages, Packaged Foods**, and more.

Using **PostgreSQL**, we performed data cleaning, exploration, and multiple business intelligence queries to derive actionable insights about product pricing, stock levels, and category-wise trends.

---

## ๐Ÿ“ฆ Dataset Overview

- **Total SKUs**: 3,104
- **Columns**: `sku_id`, `category`, `name`, `mrp`, `discountPercent`, `availableQuantity`, `discountedSellPrice`, `weightInGms`, `outOfStock`, `quantity`

---

## ๐Ÿงน Data Cleaning & Exploration

- โœ… **Null Check**: No null values found across critical columns.
- ๐Ÿ”„ **Zero Price Removal**: Removed 21 SKUs with `mrp` or `discountedSellPrice` equal to 0.
- ๐Ÿ’ฑ **Unit Conversion**: Prices converted from *paise* to *rupees*.
- ๐Ÿท๏ธ **Unique Categories**: 11
- ๐Ÿ“‰ **Stock Status**:
- In Stock: 2,612
- Out of Stock: 471 (โ‰ˆ15.16%)
- ๐Ÿ” **Duplicate Product Names**: 112 names mapped to multiple SKUs.

---

## ๐Ÿ“Š Business Analysis Queries & Insights

### ๐Ÿ”Ÿ Top 10 Best Discounted Products
- Products with up to **60%+ discounts** across personal care and packaged items.

### ๐Ÿšซ High MRP & Out-of-Stock Products
- 59 products with `MRP > โ‚น300` were unavailable.
- Highest observed MRP: โ‚น999.

### ๐Ÿ’ฐ Estimated Revenue by Category

| Category | Revenue (โ‚น) |
|---------------------|-------------|
| Packaged Foods | 1,14,230 |
| Beverages | 91,890 |
| Fruits & Vegetables | 85,140 |
| Dairy | 77,500 |
| Personal Care | 61,380 |

### ๐Ÿงพ Premium Products with Low Discounts
- 48 products with `MRP > โ‚น500` and `<10%` discount, typically premium goods.

### ๐Ÿ“‰ Top 5 Categories by Avg. Discount

| Category | Avg. Discount (%) |
|-------------------|-------------------|
| Personal Care | 26.4 |
| Beverages | 23.1 |
| Snacks | 21.7 |
| Cleaning Supplies | 20.9 |
| Dairy | 18.3 |

### โš–๏ธ Best Value Products (โ‚น per gram)
- Price efficiency calculated for products above 100g.
- Best value product: โ‚น0.07/g (e.g. rice, sugar, detergent).

### ๐Ÿท๏ธ Product Weight Category

| Weight Category | Count |
|------------------|-------|
| Low (<1kg) | 1,876 |
| Medium (1โ€“5kg) | 845 |
| Bulk (>5kg) | 215 |

### ๐Ÿ‹๏ธ Total Inventory Weight by Category

| Category | Total Weight (kg) |
|---------------------|-------------------|
| Packaged Foods | 7,140 |
| Fruits & Vegetables | 5,410 |
| Dairy | 3,860 |
| Beverages | 2,970 |
| Personal Care | 2,115 |

---

## ๐Ÿ›  Tech Stack

- **Database**: PostgreSQL 16
- **Tools**: Git, GitHub, Git Bash
- **Language**: SQL (PostgreSQL dialect)