https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
https://github.com/manalisbhavsar/mall-customers-clustering
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 8 days ago
JSON representation
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
- Host: GitHub
- URL: https://github.com/manalisbhavsar/mall-customers-clustering
- Owner: ManaliSBhavsar
- Created: 2025-02-18T07:48:35.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-02-18T08:04:23.000Z (4 months ago)
- Last Synced: 2025-05-19T14:52:19.796Z (29 days ago)
- Topics: data-analysis, data-visualization, matplotlib, numpy, pandas, python, scikit-learn
- Language: Python
- Homepage:
- Size: 73.2 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Mall Customers Clustering
This project uses K-Means clustering to analyze mall customers based on two features: **Annual Income** and **Spending Score (1-100)**. The goal is to cluster the customers into different groups to gain insights into their purchasing behavior.
## Project Structure
- `data/`: Contains the dataset (`Mall_Customers.csv`) used for clustering.
- `src/`: Contains the Python script for performing K-Means clustering (`kmeans_clustering.py`).
- `requirements.txt`: Lists the Python dependencies required to run the project.## Elbow Method
The Elbow Method is used to determine the optimal number of clusters (K). The plot of error vs. the number of clusters helps identify the point where the curve begins to flatten, indicating the optimal K.## Visualizations
Before Clustering: The initial scatter plot of data points based on Annual Income and Spending Score.After Clustering: A scatter plot with data points color-coded by cluster and the centroids marked.