https://github.com/wassimoubaziz/sad-project
This project aims to predict weekly sales for Walmart stores using machine learning algorithms. The dataset contains various features such as store type, size, department, temperature, fuel price, CPI, unemployment rate, and markdowns.
https://github.com/wassimoubaziz/sad-project
ai artificial-intelligence machine-learning matplotlib numpy oubaziz pandas python sklearn wassim
Last synced: 3 months ago
JSON representation
This project aims to predict weekly sales for Walmart stores using machine learning algorithms. The dataset contains various features such as store type, size, department, temperature, fuel price, CPI, unemployment rate, and markdowns.
- Host: GitHub
- URL: https://github.com/wassimoubaziz/sad-project
- Owner: wassimOubaziz
- License: mit
- Created: 2024-02-09T07:43:55.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-02-09T07:54:51.000Z (over 2 years ago)
- Last Synced: 2025-12-31T03:27:30.389Z (6 months ago)
- Topics: ai, artificial-intelligence, machine-learning, matplotlib, numpy, oubaziz, pandas, python, sklearn, wassim
- Language: Jupyter Notebook
- Homepage:
- Size: 6.33 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Walmart Sales Prediction Documentation
By **OUBAZIZ Wassim** and **CHELGHAM Mounsif**
**1 Introduction**
This project aims to predict weekly sales for Walmart stores using machine learning algorithms. The dataset contains various features such as store type, size, department, temperature, fuel price, CPI, unemployment rate, and markdowns.
**1.1 Objectives**
- Predict weekly sales for Walmart stores.
- Identify factors influencing sales.
- Optimize model performance.
**2 Project Setup**
The project begins with importing necessary libraries and loading the datasets.
- Python code snippet for Importing libraries and Loading datasets

Figure 1: Loading datasets
3 **Data Exploration**
We perform exploratory data analysis (EDA) to gain insights into the dataset.
1. **Overview of Data**
- Python code snippet of displaying headers of datasets

Figure 2: Overview of Data
2. **Missing Values**
- Python code snippet for checking missing values

Figure 3: Checking Missing Values
3. **Correlation Matrix**
- Python code snippet for Correlation Matrix

Figure 4: Correlation Matrix Code
Execution...

Figure 5: Correlation Matrix
4. **Store Type Distribution**
- Python code snippet for displaying a pie chart of store types

Figure 6: Code of Store Type Distribution
Execution...

Figure 7: Store Type Distribution
4 **Data Preprocessing**
We preprocess the data to prepare it for model training.
**4.1 Feature Engineering**
- Python code snippet for feature engineering

Figure 8: Feature Engineering Code
5 **Exploratory Data Analysis (continued)**
We continue exploring the data to understand the relationship between features and sales.
**5.1 Weekly Sales vs. Store Size**
- Python code snippet for plotting weekly sales vs. store size

Figure 9: Weekly Sales vs. Store Size Code
Execution...

Figure 10: Weekly Sales vs. Store Size
6 **Model Training**
We train machine learning models to predict weekly sales.
**6.1 Random Forest Regression**
- Python code snippet for training Random Forest model

Figure 11: Training Random Forest Model Code
7 **Model Evaluation**
We evaluate the performance of the trained models.
**7.1 Random Forest Regression Results**
- Python code snippet for evaluating Random Forest model

Figure 12: Actual vs. Predicted Sales (Random Forest) Code
Execution...

Figure 13: Actual vs. Predicted Sales (Random Forest)
8 **Conclusion**
The Walmart Sales Prediction project demonstrates the application of machine learning techniques to forecast weekly sales accurately. By analyzing various features and employing advanced regression models like Random Forest, we can optimize sales predictions and assist in strategic decision-making for Walmart stores. Further model refinement and feature engineering could potentially enhance prediction accuracy and provide more valu- able insights for the retail industry.
7