Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/danaelshrbiny10/backpack-price-prediction
A deep learning model for predicting backpack prices using TensorFlow and Keras. Includes data preprocessing, model training, and visualizations for performance analysis.
https://github.com/danaelshrbiny10/backpack-price-prediction
keras matplotlib pandas python3 scikit-learn seaborn tensorflow
Last synced: 3 days ago
JSON representation
A deep learning model for predicting backpack prices using TensorFlow and Keras. Includes data preprocessing, model training, and visualizations for performance analysis.
- Host: GitHub
- URL: https://github.com/danaelshrbiny10/backpack-price-prediction
- Owner: danaelshrbiny10
- License: mit
- Created: 2025-02-08T22:05:32.000Z (12 days ago)
- Default Branch: main
- Last Pushed: 2025-02-16T22:53:05.000Z (4 days ago)
- Last Synced: 2025-02-16T23:27:47.219Z (4 days ago)
- Topics: keras, matplotlib, pandas, python3, scikit-learn, seaborn, tensorflow
- Language: Jupyter Notebook
- Homepage: https://www.kaggle.com/competitions/playground-series-s5e2
- Size: 239 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Backpack-Price-Prediction
This project is part of the 2025 Kaggle Playground Series competition, where the goal is to predict the price of backpacks based on various attributes. The dataset includes categorical and numerical features such as brand, material, style, size, and weight capacity.
## Dataset
The dataset consists of the following features:
- Categorical Features: Brand, Material, Style, Color, Size
- Numerical Features: Compartments, Weight Capacity (kg)
- Target Variable: Price
## Model
The model is built using TensorFlow and Keras and follows these steps:
**1. Data Preprocessing**
- Handling missing values with SimpleImputer.
- Encoding categorical variables using Label Encoding.
- Scaling numerical features using StandardScaler.
- Splitting data into training and validation sets.**2. Neural Network Architecture**
- Embedding layer for categorical data representation.
- Fully connected layers with ReLU activation.
- Output layer with a single neuron for price prediction.**3. Model Training**
- Optimizer: Adam
- Loss function: Mean Squared Error (MSE)
- Metrics: Mean Absolute Error (MAE)
- Number of epochs: 25
- Batch size: 32**4. Evaluation & Visualization**
- Loss Curves: To track training and validation performance.
- Histogram of Prices: Distribution of predicted vs. actual prices.
- Scatter Plot: Comparison of actual vs. predicted values.## Results
- The model was trained and evaluated, achieving a reasonable RMSE for backpack price prediction.
- Predictions were generated for the test dataset and saved in `submission.csv`.
## Submission Format
The final submission file follows this format:
```bash
id,Price
300000,81.411
300001,75.293
300002,90.567
```## Running the Model
- Requirements
```bash
pip install numpy pandas tensorflow scikit-learn matplotlib seaborn
```- Execution
Run the `Price_Prediction` script
This will preprocess the data, train the model, visualize the results, and generate a submission file.
## Future Improvements
- Hyperparameter tuning to improve accuracy.
- Experimenting with different neural network architectures.
- Using advanced feature engineering techniques.
## Clone Repository
To download and work with this dataset, run:
```bash
git clone https://github.com/danaelshrbiny10/Backpack-Price-Prediction.git
```## Dataset
You can use data from [kaggle](https://www.kaggle.com/competitions/playground-series-s5e2)## LICIENCE
This project is licensed under the terms described in the [licience](./LICENSE) File.