{"id":28816745,"url":"https://github.com/amlanmohanty1/zepto-sql-data-analysis-project","last_synced_at":"2026-04-24T20:34:34.294Z","repository":{"id":299125147,"uuid":"1002137498","full_name":"amlanmohanty1/zepto-SQL-data-analysis-project","owner":"amlanmohanty1","description":"Complete Data Analysis on Zepto Inventory data using SQL","archived":false,"fork":false,"pushed_at":"2025-06-14T19:54:06.000Z","size":0,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-06-14T20:42:57.011Z","etag":null,"topics":["data-analysis","database","inventory-management","postgresql","sql","zepto"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/amlanmohanty1.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-06-14T19:49:04.000Z","updated_at":"2025-06-14T19:54:09.000Z","dependencies_parsed_at":"2025-06-14T20:42:58.297Z","dependency_job_id":"a34a20ae-95aa-424c-abcb-8fc35cf0fdef","html_url":"https://github.com/amlanmohanty1/zepto-SQL-data-analysis-project","commit_stats":null,"previous_names":["amlanmohanty1/zepto-sql-data-analysis-project"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/amlanmohanty1/zepto-SQL-data-analysis-project","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amlanmohanty1%2Fzepto-SQL-data-analysis-project","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amlanmohanty1%2Fzepto-SQL-data-analysis-project/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amlanmohanty1%2Fzepto-SQL-data-analysis-project/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amlanmohanty1%2Fzepto-SQL-data-analysis-project/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/amlanmohanty1","download_url":"https://codeload.github.com/amlanmohanty1/zepto-SQL-data-analysis-project/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amlanmohanty1%2Fzepto-SQL-data-analysis-project/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":260595743,"owners_count":23033789,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","database","inventory-management","postgresql","sql","zepto"],"created_at":"2025-06-18T17:06:46.369Z","updated_at":"2026-04-24T20:34:34.289Z","avatar_url":"https://github.com/amlanmohanty1.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🛒 Zepto E-commerce SQL Data Analyst Portfolio Project\nThis is a complete, real-world data analyst portfolio project based on an e-commerce inventory dataset scraped from [Zepto](https://www.zeptonow.com/) — one of India’s fastest-growing quick-commerce startups. This project simulates real analyst workflows, from raw data exploration to business-focused data analysis.\n\nThis project is perfect for:\n- 📊 Data Analyst aspirants who want to build a strong **Portfolio Project** for interviews and LinkedIn\n- 📚 Anyone learning SQL hands-on\n- 💼 Preparing for interviews in retail, e-commerce, or product analytics\n\n# **🎥 Watch this [YouTube video](https://www.youtube.com/watch?v=x8dfQkKTyP0\u0026list=PLAx-M6Di0SisFJ1rv5M_FRHUlGA5rtUf_\u0026index=2) to implement the full project from scratch:**  \n[![SQL Data Analyst Portfolio Project using Zepto Inventory Dataset](https://github.com/user-attachments/assets/a1895ada-15e4-4f98-aa0d-597a4092c845)](https://www.youtube.com/watch?v=x8dfQkKTyP0\u0026list=PLAx-M6Di0SisFJ1rv5M_FRHUlGA5rtUf_\u0026index=2)\n🔗 *Link to Video:* [Watch on Youtube](https://www.youtube.com/watch?v=x8dfQkKTyP0\u0026list=PLAx-M6Di0SisFJ1rv5M_FRHUlGA5rtUf_\u0026index=2)\n\n## 📌 Project Overview\n\nThe goal is to simulate how actual data analysts in the e-commerce or retail industries work behind the scenes to use SQL to:\n\n✅ Set up a messy, real-world e-commerce inventory **database**\n\n✅ Perform **Exploratory Data Analysis (EDA)** to explore product categories, availability, and pricing inconsistencies\n\n✅ Implement **Data Cleaning** to handle null values, remove invalid entries, and convert pricing from paise to rupees\n\n✅ Write **business-driven SQL queries** to derive insights around **pricing, inventory, stock availability, revenue** and more\n\n## 📁 Dataset Overview\nThe dataset was sourced from [Kaggle](https://www.kaggle.com/datasets/palvinder2006/zepto-inventory-dataset/data?select=zepto_v2.csv) and was originally scraped from Zepto’s official product listings. It mimics what you’d typically encounter in a real-world e-commerce inventory system.\n\nEach row represents a unique SKU (Stock Keeping Unit) for a product. Duplicate product names exist because the same product may appear multiple times in different package sizes, weights, discounts, or categories to improve visibility – exactly how real catalog data looks.\n\n🧾 Columns:\n- **sku_id:** Unique identifier for each product entry (Synthetic Primary Key)\n\n- **name:** Product name as it appears on the app\n\n- **category:** Product category like Fruits, Snacks, Beverages, etc.\n\n- **mrp:** Maximum Retail Price (originally in paise, converted to ₹)\n\n- **discountPercent:** Discount applied on MRP\n\n- **discountedSellingPrice:** Final price after discount (also converted to ₹)\n\n- **availableQuantity:** Units available in inventory\n\n- **weightInGms:** Product weight in grams\n\n- **outOfStock:** Boolean flag indicating stock availability\n\n- **quantity:** Number of units per package (mixed with grams for loose produce)\n\n## 🔧 Project Workflow\n\nHere’s a step-by-step breakdown of what we do in this project:\n\n### 1. Database \u0026 Table Creation\nWe start by creating a SQL table with appropriate data types:\n\n```sql\nCREATE TABLE zepto (\n  sku_id SERIAL PRIMARY KEY,\n  category VARCHAR(120),\n  name VARCHAR(150) NOT NULL,\n  mrp NUMERIC(8,2),\n  discountPercent NUMERIC(5,2),\n  availableQuantity INTEGER,\n  discountedSellingPrice NUMERIC(8,2),\n  weightInGms INTEGER,\n  outOfStock BOOLEAN,\n  quantity INTEGER\n);\n```\n\n### 2. Data Import\n- Loaded CSV using pgAdmin's import feature.\n\n - If you're not able to use the import feature, write this code instead:\n```sql\n   \\copy zepto(category,name,mrp,discountPercent,availableQuantity,\n            discountedSellingPrice,weightInGms,outOfStock,quantity)\n  FROM 'data/zepto_v2.csv' WITH (FORMAT csv, HEADER true, DELIMITER ',', QUOTE '\"', ENCODING 'UTF8');\n```\n- Faced encoding issues (UTF-8 error), which were fixed by saving the CSV file using CSV UTF-8 format.\n\n### 3. 🔍 Data Exploration\n- Counted the total number of records in the dataset\n\n- Viewed a sample of the dataset to understand structure and content\n\n- Checked for null values across all columns\n\n- Identified distinct product categories available in the dataset\n\n- Compared in-stock vs out-of-stock product counts\n\n- Detected products present multiple times, representing different SKUs\n\n### 4. 🧹 Data Cleaning\n- Identified and removed rows where MRP or discounted selling price was zero\n\n- Converted mrp and discountedSellingPrice from paise to rupees for consistency and readability\n  \n### 5. 📊 Business Insights\n- Found top 10 best-value products based on discount percentage\n\n- Identified high-MRP products that are currently out of stock\n\n- Estimated potential revenue for each product category\n\n- Filtered expensive products (MRP \u003e ₹500) with minimal discount\n\n- Ranked top 5 categories offering highest average discounts\n\n- Calculated price per gram to identify value-for-money products\n\n- Grouped products based on weight into Low, Medium, and Bulk categories\n\n- Measured total inventory weight per product category\n\n\n## 🛠️ How to Use This Project\n\n1. **Clone the repository**\n   ```bash\n   git clone https://github.com/amlanmohanty/zepto-SQL-data-analysis-project.git\n   cd zepto-SQL-data-analysis-project\n   ```\n2. **Open zepto_SQL_data_analysis.sql**\n\n    This file contains:\n\n      - Table creation\n\n      - Data exploration\n\n      - Data cleaning\n\n      - SQL Business analysis\n  \n3. **Load the dataset into pgAdmin or any other PostgreSQL client**\n\n      - Create a database and run the SQL file\n\n      - Import the dataset (convert to UTF-8 if necessary)\n\n4. **Follow along with the YouTube video for full walkthrough. 👨‍💼**\n\n## 📜 License\n\nMIT — feel free to fork, star, and use in your portfolio.\n\n## 👨‍💻 About the Author\nHey, I’m Amlan Mohanty — a Data Analyst \u0026 Content Creator.\nI break down complex data topics into simple, practical content that actually helps you land a job.\n\n ### 🚀 Stay Connected \u0026 Join the Data Drool Community\nIf you enjoyed this project and want to keep learning and growing as a data analyst, let’s stay in touch! I regularly share content around SQL, data analytics, portfolio projects, job tips, and more.\n\n🎥 YouTube: [Amlan Mohanty](https://www.youtube.com/@amlanmohanty1)\n- Beginner-friendly tutorials, real-world projects, job and career advice\n\n📺 Instagram: [data.drool](https://www.instagram.com/data.drool/)\n- Quick SQL tips, data memes, and behind-the-scenes content\n\n💼 LinkedIn: [Amlan Mohanty](https://www.linkedin.com/in/amlanmohanty1/)\n- Let’s connect professionally and grow your data career\n\n\n## 💡 Thanks for checking out the project! Your support means a lot — feel free to star ⭐ this repo or share it with someone learning SQL.🚀\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famlanmohanty1%2Fzepto-sql-data-analysis-project","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Famlanmohanty1%2Fzepto-sql-data-analysis-project","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famlanmohanty1%2Fzepto-sql-data-analysis-project/lists"}