https://github.com/jibbs1703/classic-ml-models
This repository contains scripts for developing, training and evaluating machine learning models using several python frameworks.
https://github.com/jibbs1703/classic-ml-models
aws data-preprocessing data-science deep-learning feature-engineering machine-learning multiclass-classification neural-networks predictive-modeling pyspark-mllib pytest scikit-learn xgboost-classifier
Last synced: 3 months ago
JSON representation
This repository contains scripts for developing, training and evaluating machine learning models using several python frameworks.
- Host: GitHub
- URL: https://github.com/jibbs1703/classic-ml-models
- Owner: jibbs1703
- License: mit
- Created: 2024-03-15T22:30:53.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-03-26T06:13:22.000Z (over 1 year ago)
- Last Synced: 2025-03-26T06:23:59.405Z (over 1 year ago)
- Topics: aws, data-preprocessing, data-science, deep-learning, feature-engineering, machine-learning, multiclass-classification, neural-networks, predictive-modeling, pyspark-mllib, pytest, scikit-learn, xgboost-classifier
- Language: Jupyter Notebook
- Homepage:
- Size: 3.3 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Machine Learning Model Development Using Python Frameworks
This repository contains scripts for developing, training and evaluating various machine learning models using python frameworks such as PySpark Mlib (the Machine Learning Framework provided in PySpark), Scikit-Learn, XGBoost and Neural Networks.
The model development process is tracked using MLflow, allowing for transformer and estimator parameters to be tracked, logged and eventually registered throughout the model development process.
## **Models Developed:**
**Multiclass Obesity:**
* Description: Predicts the obesity level of patients based on their eating/lifestyle habits and physical condition.
* Includes: Data preprocessing, model selection,hyperparameter tuning, model evaluation metrics (accuracy, precision, recall, F1-score).
**Adult Income:**
* Description: Classifies the earnings of individuals into two classes - above $50k or below $50k.
* Includes: Data preprocessing, model selection,hyperparameter tuning, model evaluation metrics (accuracy, precision, recall, F1-score).
```bash
docker build -t classic-ml-models-dev .
```
```bash
docker run -it --name classic-ml-models-container -v .:/workspace -p 8888:8888 classic-ml-models-dev
```