{"id":18929070,"url":"https://github.com/tushard48/product-cluster-analysis","last_synced_at":"2026-05-03T13:43:12.213Z","repository":{"id":251162492,"uuid":"836584070","full_name":"TusharD48/Product-Cluster-Analysis","owner":"TusharD48","description":"This project performs clustering analysis on a product dataset to identify and group similar products. The analysis includes data preprocessing, application of various clustering algorithms, and visualization of results to gain insights into product patterns. Key techniques used are K-Means, Mini Batch K-Means, evaluated using metr","archived":false,"fork":false,"pushed_at":"2024-08-02T08:09:00.000Z","size":9111,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-01T11:47:10.304Z","etag":null,"topics":["data-visualization","excel","machine-learning","powerbi","streamlit","unsupervised-learning"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/TusharD48.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-01T06:47:12.000Z","updated_at":"2024-08-02T08:09:03.000Z","dependencies_parsed_at":"2024-11-08T11:32:09.656Z","dependency_job_id":"58aa8082-d4f1-4120-9c91-c79c2712245a","html_url":"https://github.com/TusharD48/Product-Cluster-Analysis","commit_stats":null,"previous_names":["tushard48/product-cluster-analysis"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/TusharD48/Product-Cluster-Analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TusharD48%2FProduct-Cluster-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TusharD48%2FProduct-Cluster-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TusharD48%2FProduct-Cluster-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TusharD48%2FProduct-Cluster-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/TusharD48","download_url":"https://codeload.github.com/TusharD48/Product-Cluster-Analysis/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TusharD48%2FProduct-Cluster-Analysis/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32571456,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-03T06:36:36.687Z","status":"ssl_error","status_checked_at":"2026-05-03T06:36:09.306Z","response_time":103,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-visualization","excel","machine-learning","powerbi","streamlit","unsupervised-learning"],"created_at":"2024-11-08T11:29:41.672Z","updated_at":"2026-05-03T13:43:12.195Z","avatar_url":"https://github.com/TusharD48.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# **Product Cluster Analysis**\n# Overview\nThis project focuses on the clustering of products based on various attributes to identify patterns and group similar products together. Clustering can help in inventory management, marketing strategies, and enhancing the overall customer experience by understanding product affinities.\n\n# Table of Contents\n- Project Description\n- Dataset\n- Prerequisites\n- Installation\n- Usage\n- Methods\n- Results\n- Contributing\n- License\n- Contact\n# Project Description\nThe primary objective of this project is to apply clustering techniques on a product dataset to identify distinct groups. The analysis includes preprocessing of data, application of clustering algorithms, and evaluation of the results to derive meaningful insights.\n\n# Dataset\nThe dataset contains the following columns:\n\n- YEAR\n- MONTH\n- SUPPLIER\n- ITEM CODE\n- ITEM DESCRIPTION\n- ITEM TYPE\n- RETAIL SALES\n- RETAIL TRANSFERS\n- WAREHOUSE SALES\n  \n# Prerequisites\n- Python 3.x\n- pandas\n- numpy\n- scikit-learn\n- matplotlib\n- seaborn\n  \n# Installation\nClone the repository:\n- https://github.com/TusharD48/Product-Cluster-Analysis.git\n  \n# Methods\n- Data Preprocessing: Handling missing values, encoding categorical variables, and normalizing numerical features.\n- Clustering Algorithms: K-Means, Mini batch K-Means\n- Evaluation Metrics: Silhouette Score, Davies-Bouldin Index.\n- Visualization: Cluster heatmaps, scatter plots, and distribution plots.\n- \n# Results\nThe optimal number of clusters was determined to be 5 based on the Silhouette Score. The clustering results reveal distinct groups of products with similar characteristics. Detailed results and visualizations can be found in the https://github.com/TusharD48/Product-Cluster-Analysis/blob/main/Product_cluster_analysis_main.ipynb/ directory.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftushard48%2Fproduct-cluster-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftushard48%2Fproduct-cluster-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftushard48%2Fproduct-cluster-analysis/lists"}