An open API service indexing awesome lists of open source software.

https://github.com/ani717/xgboost-mlops-pipeline

An end-to-end machine learning pipeline using XGBoost trained on the sklearn Breast Cancer dataset. This project demonstrates a full production workflow.
https://github.com/ani717/xgboost-mlops-pipeline

docker-image fastapi-framework kubernetes-deployment logging mlflow pytest requestid validation xgboost-classifier

Last synced: about 1 month ago
JSON representation

An end-to-end machine learning pipeline using XGBoost trained on the sklearn Breast Cancer dataset. This project demonstrates a full production workflow.

Awesome Lists containing this project

README

          

# πŸš€ XGBoost MLOps Pipeline

A complete end-to-end machine learning pipeline built using **XGBoost**, demonstrating model development, packaging, API serving, and deployment to Kubernetes.

---

## πŸ“Œ Key Components

- πŸ”¬ **Model Training** with MLflow experiment tracking
- πŸ“¦ **Model Packaging** as a Python wheel for reusability
- ⚑ **Model Serving** via a FastAPI application
- 🐳 **Containerization** with Docker
- ☸️ **Deployment** to a local or cloud Kubernetes cluster

---

## πŸ“ Project Structure

- [`dev/`](./dev) – Model development and training with MLflow
- [`model/`](./model) – Packaging trained model into a Python wheel
- [`api/`](./api) – Serving predictions via FastAPI

---

## 🧠 Tech Stack

- `XGBoost` for model training
- `MLflow` for experiment tracking
- `FastAPI` for serving the model
- `Docker` for containerization
- `Kubernetes` for orchestration
- `Pytest` for testing with coverage
- `Logging`, `RequestID`, and `Validation` middleware for observability