Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/rakumar99/sales-prediction-analysis-using-machine-learning
This project predicts customer purchase amounts during Black Friday sales using machine learning models. The analysis leverages customer and product data, applying algorithms like Linear Regression, Decision Trees, and Random Forests. Future improvements include hyperparameter tuning, new feature creation, and experimenting with additional models.
https://github.com/rakumar99/sales-prediction-analysis-using-machine-learning
jupyter-notebook machine-learning-algorithms predictive-modeling python
Last synced: about 1 month ago
JSON representation
This project predicts customer purchase amounts during Black Friday sales using machine learning models. The analysis leverages customer and product data, applying algorithms like Linear Regression, Decision Trees, and Random Forests. Future improvements include hyperparameter tuning, new feature creation, and experimenting with additional models.
- Host: GitHub
- URL: https://github.com/rakumar99/sales-prediction-analysis-using-machine-learning
- Owner: rakumar99
- Created: 2024-10-20T13:01:55.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-10-20T13:30:06.000Z (3 months ago)
- Last Synced: 2024-10-26T01:42:39.271Z (3 months ago)
- Topics: jupyter-notebook, machine-learning-algorithms, predictive-modeling, python
- Language: Jupyter Notebook
- Homepage:
- Size: 9.18 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Sales-Prediction-Analysis-using-Machine-Learning
### Project Overview:
This project focuses on predicting the purchase amount of customers during Black Friday sales using a dataset of sales transactions from a retail store. The analysis aims to help retailers understand customer purchasing behavior and optimize sales strategies. This regression problem involves building machine learning models to predict purchase amounts based on customer and product attributes.
The dataset consists of 550,069 rows and 12 columns, providing a rich opportunity to apply feature engineering techniques and explore various machine learning algorithms.
### Dataset Information:
The dataset captures sales transaction details from Black Friday shopping at a retail store. It contains a variety of customer and product-related attributes that can help predict the purchase amount. The problem at hand is to predict the total purchase amount for each transaction.
Dataset Size: 550,069 rows, 12 columns.
Problem Type: Regression (predicting a continuous value)
Problem: Predict Purchase Amount
Dataset Download Link: https://www.kaggle.com/kkartik93/black-friday-sales-prediction
### Libraries and Tools Used:
This project makes use of several Python libraries to process data, visualize trends, and build machine learning models.
pandas: For data manipulation and analysis.
matplotlib: For visualizing trends and patterns in the dataset.
seaborn: For advanced statistical data visualizations.
scikit-learn: For building and evaluating machine learning models.
### Machine Learning Algorithms:
Several machine learning algorithms were applied to predict the purchase amount:
Linear Regression: A simple algorithm that fits a linear model to predict continuous values.
Decision Tree: A tree-based algorithm that splits data based on decision rules to make predictions.
Random Forest: An ensemble method that builds multiple decision trees to improve accuracy.
Extra Trees: Another ensemble method that builds multiple trees with randomized splits to enhance model performance.
### Future Work:
To further improve the model’s performance and gain more insights, the following steps are planned:
Hyperparameter Tuning: Applying techniques such as grid search or random search to fine-tune the parameters of the models for optimal performance.
Experimenting with Different Models: Trying other machine learning algorithms like Gradient Boosting, XGBoost, or Neural Networks to see if they provide better results.
Creation of New Attributes: Engineering new features from the existing data to enhance predictive power.
Normalization: Applying normalization techniques to improve model accuracy by standardizing data values.### Conclusion:
This project demonstrates how machine learning models can be applied to real-world retail data to predict purchase amounts, helping businesses better understand customer behavior and optimize sales strategies.