{"id":18106827,"url":"https://github.com/rakumar99/sales-prediction-analysis-using-machine-learning","last_synced_at":"2026-05-02T19:31:25.206Z","repository":{"id":259500174,"uuid":"875614129","full_name":"rakumar99/Sales-Prediction-Analysis-using-Machine-Learning","owner":"rakumar99","description":"This project predicts customer purchase amounts during Black Friday sales using machine learning models. The analysis leverages customer and product data, applying algorithms like Linear Regression, Decision Trees, and Random Forests. Future improvements include hyperparameter tuning, new feature creation, and experimenting with additional models.","archived":false,"fork":false,"pushed_at":"2024-10-20T13:30:06.000Z","size":9622,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-06T06:45:49.559Z","etag":null,"topics":["jupyter-notebook","machine-learning-algorithms","predictive-modeling","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/rakumar99.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-10-20T13:01:55.000Z","updated_at":"2024-10-21T02:28:51.000Z","dependencies_parsed_at":"2024-10-26T10:16:32.942Z","dependency_job_id":null,"html_url":"https://github.com/rakumar99/Sales-Prediction-Analysis-using-Machine-Learning","commit_stats":null,"previous_names":["rakumar99/sales-prediction-analysis-using-machine-learning"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rakumar99%2FSales-Prediction-Analysis-using-Machine-Learning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rakumar99%2FSales-Prediction-Analysis-using-Machine-Learning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rakumar99%2FSales-Prediction-Analysis-using-Machine-Learning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rakumar99%2FSales-Prediction-Analysis-using-Machine-Learning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/rakumar99","download_url":"https://codeload.github.com/rakumar99/Sales-Prediction-Analysis-using-Machine-Learning/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247445649,"owners_count":20939953,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["jupyter-notebook","machine-learning-algorithms","predictive-modeling","python"],"created_at":"2024-10-31T23:08:10.244Z","updated_at":"2026-05-02T19:31:25.137Z","avatar_url":"https://github.com/rakumar99.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Sales-Prediction-Analysis-using-Machine-Learning\n\n### Project Overview:\n\n\u003ch5\u003eThis project focuses on predicting the purchase amount of customers during Black Friday sales using a dataset of sales transactions from a retail store. The analysis aims to help retailers understand customer purchasing behavior and optimize sales strategies. This regression problem involves building machine learning models to predict purchase amounts based on customer and product attributes.\u003c/h5\u003e\n\n\n\u003ch5\u003eThe dataset consists of 550,069 rows and 12 columns, providing a rich opportunity to apply feature engineering techniques and explore various machine learning algorithms.\u003c/h5\u003e\n\n\n\n### Dataset Information:\n\n\u003ch5\u003eThe dataset captures sales transaction details from Black Friday shopping at a retail store. It contains a variety of customer and product-related attributes that can help predict the purchase amount. The problem at hand is to predict the total purchase amount for each transaction.\u003c/h5\u003e\n\n\u003ch5\u003eDataset Size: 550,069 rows, 12 columns.\u003c/h5\u003e\n\n\u003ch4\u003eProblem Type: Regression (predicting a continuous value)\u003c/h4\u003e\n\n\nProblem: Predict Purchase Amount\n\n\u003ch4\u003eDataset Download Link: https://www.kaggle.com/kkartik93/black-friday-sales-prediction\u003c/h4\u003e\n\n\n### Libraries and Tools Used:\n\n\u003ch4\u003eThis project makes use of several Python libraries to process data, visualize trends, and build machine learning models.\n\n\npandas: For data manipulation and analysis.\n\nmatplotlib: For visualizing trends and patterns in the dataset.\n\nseaborn: For advanced statistical data visualizations.\n\nscikit-learn: For building and evaluating machine learning models.\u003c/h4\u003e\n\n\n### Machine Learning Algorithms:\n\n\u003ch4\u003eSeveral machine learning algorithms were applied to predict the purchase amount:\n\nLinear Regression: A simple algorithm that fits a linear model to predict continuous values.\n\nDecision Tree: A tree-based algorithm that splits data based on decision rules to make predictions.\n\nRandom Forest: An ensemble method that builds multiple decision trees to improve accuracy.\n\nExtra Trees: Another ensemble method that builds multiple trees with randomized splits to enhance model performance.\u003c/h4\u003e\n\n\n### Future Work:\n\n\u003ch4\u003eTo further improve the model’s performance and gain more insights, the following steps are planned:\u003c/h4\u003e\n\n\u003ch4\u003eHyperparameter Tuning: Applying techniques such as grid search or random search to fine-tune the parameters of the models for optimal performance.\nExperimenting with Different Models: Trying other machine learning algorithms like Gradient Boosting, XGBoost, or Neural Networks to see if they provide better results.\nCreation of New Attributes: Engineering new features from the existing data to enhance predictive power.\nNormalization: Applying normalization techniques to improve model accuracy by standardizing data values.\u003c/h4\u003e\n\n\n### Conclusion:\n\nThis project demonstrates how machine learning models can be applied to real-world retail data to predict purchase amounts, helping businesses better understand customer behavior and optimize sales strategies.\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frakumar99%2Fsales-prediction-analysis-using-machine-learning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frakumar99%2Fsales-prediction-analysis-using-machine-learning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frakumar99%2Fsales-prediction-analysis-using-machine-learning/lists"}