Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/rayyan9477/diamond-price-forecasting
This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree
https://github.com/rayyan9477/diamond-price-forecasting
data-analysis data-science decision-trees eda linear-regression machine-learning
Last synced: 4 days ago
JSON representation
This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree
- Host: GitHub
- URL: https://github.com/rayyan9477/diamond-price-forecasting
- Owner: Rayyan9477
- License: mit
- Created: 2024-08-02T14:16:31.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-08-13T19:05:16.000Z (3 months ago)
- Last Synced: 2024-08-13T22:22:19.269Z (3 months ago)
- Topics: data-analysis, data-science, decision-trees, eda, linear-regression, machine-learning
- Language: Jupyter Notebook
- Homepage:
- Size: 5.47 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Diamond Price Forecasting: Data Science and Machine Learning Approach
- [LinkedIn](https://www.linkedin.com/in/rayyan-ahmed9477/)
## About The Project
The Diamond Price Prediction project is an end-to-end data science initiative focused on forecasting diamond prices using advanced machine learning techniques. By conducting extensive exploratory data analysis (EDA), this project aims to uncover the key factors that influence diamond pricing. Leveraging data analytics, it provides accurate and valuable price predictions, offering critical insights to both buyers and sellers in the diamond market. Through the integration of machine learning models, this project enhances decision-making processes, ensuring informed transactions in the highly valuable diamond industry.
## About the Data
The dataset The goal is to predict the price of a given diamond (Regression Analysis).
There are 10 independent variables (including id):
- id: the unique identifier of each diamond
- carat: Carat (ct.) refers to the unique unit of weight measurement used exclusively to weigh gemstones and diamonds.
- cut: Quality of Diamond Cut
- color: Color of Diamond
- clarity: Diamond clarity is a measure of the purity and rarity of the stone, graded by the visibility of these characteristics under 10-power magnification.
- depth: The depth of the diamond is its height (in millimetres) measured from the culet (bottom tip) to the table (flat, top surface)
- table: A diamond's table is the facet which can be seen when the stone is viewed face up.
- x : Diamond X dimension
- y: Diamond Y dimension
- z: Diamond Z dimension### Target variable:
- price: Price of the given Diamond.
Dataset Source Link : ```https://www.kaggle.com/competitions/playground-series-s3e8/data?select=train.csv```
## Built With
- Pandas
- Numpy
- Scikit-learn
- Flask
- DVC
- MLFlow
- Seaborn
- Matplotlib## Installation Steps
### Option 1: Installation from GitHub
Follow these steps to install and set up the project directly from the GitHub repository:
1. **Clone the Project or Download the Code**
- From Github download the code or clone repo
2. **Create a Virtual Environment** (Optional but recommended)
- It's a good practice to create a virtual environment to manage project dependencies. Run the following command:
```
conda create -p python== -y
```3. **Activate the Virtual Environment** (Optional)
- Activate the virtual environment based on your operating system:
```
conda activate /
```4. **Install Dependencies**
- Navigate to the project directory:
```
cd [project_directory]
```
- Run the following command to install project dependencies:
```
pip install -r requirements.txt
```5. **Run the Project**
- Start the project by running the appropriate command.
```
python app.py
```6. **Access the Project**
- Open a web browser or the appropriate client to access the project.
- http://localhost:8080
![Home](https://github.com/Rayyan9477/Diamond-Price-Forecasting/blob/main/static/Home.png)![Output](https://github.com/Rayyan9477/Diamond-Price-Forecasting/blob/main/static/Output.png)
## Usage and Configuration
This project requires Amazon Web Services Access Key ID and Secret Access Key for interacting with AWS services. Follow these steps to configure your project to use AWS keys:
1. **Obtain Your AWS Access Key ID and Secret Access Key**:
- Log in to the AWS Management Console.
- Open the IAM (Identity and Access Management) dashboard.
- Create a new IAM user or use an existing one.
- Attach the necessary policies to the user.
- Generate an access key for the user. Save these keys securely.2. **Configuration**:
- Store your AWS Access Key ID and Secret Access Key securely. Do not hardcode them directly in your code or expose them in public repositories. Instead, use environment variables or a configuration file to manage them securely.## Contact
Rayyan Ahmed - [[email protected]]([email protected])