Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tushar2704/wine_quality
This project aims to predict the quality of wines using various machine learning algorithms. It utilizes the MLflow platform to manage the end-to-end machine learning lifecycle, including data preprocessing, model training, hyperparameter tuning, and deployment on AWS EC2.
https://github.com/tushar2704/wine_quality
artificial-intelligence aws cicd css dagshub data-science deployment docker flask-application html python streamlit-tushar2704 tushar2704
Last synced: about 1 month ago
JSON representation
This project aims to predict the quality of wines using various machine learning algorithms. It utilizes the MLflow platform to manage the end-to-end machine learning lifecycle, including data preprocessing, model training, hyperparameter tuning, and deployment on AWS EC2.
- Host: GitHub
- URL: https://github.com/tushar2704/wine_quality
- Owner: tushar2704
- License: mit
- Created: 2023-08-16T12:24:45.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-06-06T22:47:55.000Z (8 months ago)
- Last Synced: 2024-06-06T23:49:33.088Z (8 months ago)
- Topics: artificial-intelligence, aws, cicd, css, dagshub, data-science, deployment, docker, flask-application, html, python, streamlit-tushar2704, tushar2704
- Language: CSS
- Homepage: https://tushar-aggarwal.com
- Size: 243 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Wine Quality Prediction with MLFLOW experiments
### Predicting wine quality using machine learning techniques and managing the end-to-end workflow with MLFLOW experiments on Dagshub, Deployment via Docker image on AWS EC2
![Python Version](https://img.shields.io/badge/python-3.11%20%7C%203.10-blue)
![MLflow Version](https://img.shields.io/badge/MLflow-2.6.0-green)## Introduction
This project aims to predict the quality of wines using various machine learning algorithms. It utilizes the MLflow platform to manage the end-to-end machine learning lifecycle, including data preprocessing, model training, hyperparameter tuning, and deployment.
## Features
- Data preprocessing pipeline for cleaning and transforming the dataset.
- Support for multiple machine learning algorithms for wine quality prediction.
- Hyperparameter tuning using grid search or random search.
- Tracking and logging experiments with MLflow for easy comparison.
- REST API endpoint for making predictions using the trained model.
- Dockerized environment for seamless deployment.## Project Structure
```
├── data/
│ ├── wine-quality.csv
│ └── ...
├── models/
│ ├── model.pkl
│ └── ...
├── notebooks/
│ ├── data_exploration.ipynb
│ ├── model_experimentation.ipynb
│ └── ...
├── .gitignore
├── Dockerfile
├── preprocess_data.py
├── train.py
├── predict_api.py
└── README.md
```## Getting Started
### STEPS:
Clone the repository
```bash
https://github.com/tushar2704/Wine_Quality
```
### STEP 01- Create a conda environment after opening the repository```bash
conda create -n ml python=3.11 -y
``````bash
conda activate ml
```### STEP 02- install the requirements
```bash
pip install -r requirements.txt
``````bash
# Finally run the following command
python app.py
```Now,
```bash
open up you local host and port
```## MLflow
[Documentation](https://mlflow.org/docs/latest/index.html)
##### cmd
- mlflow ui### dagshub
[dagshub](https://dagshub.com/)MLFLOW_TRACKING_URI=https://dagshub.com/tushar27/Wine_Quality.mlflow \
MLFLOW_TRACKING_USERNAME=tushar27 \
MLFLOW_TRACKING_PASSWORD="" \
python main.pyRun this to export as env variables:
```bash
export MLFLOW_TRACKING_URI=https://dagshub.com/tushar27/Wine_Quality.mlfloww
export MLFLOW_TRACKING_USERNAME=tushar27
export MLFLOW_TRACKING_PASSWORD=""
```
# AWS-CICD-Deployment-with-Github-Actions
## 1. Login to AWS console.
## 2. Create IAM user for deployment
#with specific access
1. EC2 access : It is virtual machine
2. ECR: Elastic Container registry to save your docker image in aws
#Description: About the deployment
1. Build docker image of the source code
2. Push your docker image to ECR
3. Launch Your EC2
4. Pull Your image from ECR in EC2
5. Lauch your docker image in EC2
#Policy:
1. AmazonEC2ContainerRegistryFullAccess
2. AmazonEC2FullAccess
## 3. Create ECR repo to store/save docker image
- Save the URI: 566373416292.dkr.ecr.ap-south-1.amazonaws.com/mlproj
## 4. Create EC2 machine (Ubuntu)## 5. Open EC2 and Install docker in EC2 Machine:
#optinalsudo apt-get update -y
sudo apt-get upgrade
#requiredcurl -fsSL https://get.docker.com -o get-docker.sh
sudo sh get-docker.sh
sudo usermod -aG docker ubuntu
newgrp docker
# 6. Configure EC2 as self-hosted runner:
setting>actions>runner>new self hosted runner> choose os> then run command one by one# 7. Setup github secrets:
AWS_ACCESS_KEY_ID=
AWS_SECRET_ACCESS_KEY=
AWS_REGION = us-east-1
AWS_ECR_LOGIN_URI = demo>> 566373416292.dkr.ecr.ap-south-1.amazonaws.com
ECR_REPOSITORY_NAME = simple-app
## Contact Information
If you have any questions, feedback, or collaboration opportunities, please feel free to reach out to me. You can contact me via email at [[email protected]](mailto:[email protected]) or connect with me on LinkedIn at [Tushar Aggarwal](https://www.linkedin.com/in/yourname).
Thank you for visiting my Data Analysis Portfolio! I hope you find my projects informative and insightful.
## Author
- [©2023 Tushar Aggarwal. All rights reserved](https://www.tushar-aggarwal.com/)
- [LinkedIn](https://www.linkedin.com/in/tusharaggarwalinseec/)
- [Medium](https://medium.com/@tushar_aggarwal)
- [Tushar-Aggarwal.com](https://www.tushar-aggarwal.com/)
- [New Kaggle](https://www.kaggle.com/tagg27)