Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sinha532/ev_market_segmentation
A Data Analysis on electric vehicle market segmentation on different parts of India, using python frameworks for data visualization and analysis
https://github.com/sinha532/ev_market_segmentation
Last synced: about 2 months ago
JSON representation
A Data Analysis on electric vehicle market segmentation on different parts of India, using python frameworks for data visualization and analysis
- Host: GitHub
- URL: https://github.com/sinha532/ev_market_segmentation
- Owner: Sinha532
- Created: 2023-07-11T17:33:05.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-06-21T13:44:04.000Z (7 months ago)
- Last Synced: 2024-06-22T06:21:07.126Z (7 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 1.13 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Market Segmentation On Electric Car Data
## Project Overview
This project focuses on market segmentation for electric cars by applying various data processing and machine learning techniques. The goal is to predict different segments of electric car data to better understand customer preferences and market trends.## Data Pre-Processing
Data preprocessing is crucial for improving the quality of data and ensuring accurate results. The steps involved in preprocessing include:- **Handling Missing Values:** Checking for and addressing missing values to avoid incorrect analysis.
- **Removing Duplicates:** Ensuring there are no duplicate records in the dataset.
- **Identifying Outliers:** Detecting and treating outliers that may distort the model's learning process.
- **Data Encoding:** Converting categorical data into a suitable numerical format for machine learning algorithms.## Techniques and Tools Used
The project employs various data visualization and preprocessing tools, including:- **NumPy and Pandas:** For efficient data manipulation and analysis.
- **Matplotlib and Seaborn:** For creating insightful visualizations to understand data distribution and relationships.
- **Categorical Encoding:** Transforming categorical variables into numerical ones to make them suitable for machine learning models.## Data Visualization
Data visualization helps in identifying patterns, trends, and outliers in the dataset. Key visualizations include:- **KDE Plots:** To visualize the probability density of continuous variables.
- **Bar Plots:** For comparing categorical data such as the number of seats or top speeds by brand.
- **Pie Charts:** To represent the distribution of different plug types used for charging.
- **Histograms:** For displaying the distribution of numerical data.## Machine Learning Algorithms
Two clustering techniques were primarily used in this project to segment the electric car data:### K-Means Clustering
K-Means clustering is a popular unsupervised learning algorithm used to partition data into k clusters. The steps involved are:
- **Elbow Method:** To determine the optimal number of clusters by plotting within-cluster sum of squares (WCSS) against the number of clusters.
- **Cluster Visualization:** To visualize the clusters formed and the centroids of these clusters.### Hierarchical Clustering
Hierarchical clustering is another unsupervised learning technique that builds a hierarchy of clusters:
- **Agglomerative Clustering:** Merges data points into clusters based on their similarity.
- **Dendrogram:** A tree-like diagram that shows the arrangement of the clusters formed.## Project Summary
The project showcases the application of various data processing and machine learning techniques to segment electric car data. By understanding different market segments, businesses can make informed decisions to cater to specific customer needs.---
Feel free to explore the visualizations and results in this repository to gain insights into the electric car market segmentation.