https://github.com/jxareas/ml-zoomcamp-2024
Solutions for the Machine Learning Zoomcamp 2024
https://github.com/jxareas/ml-zoomcamp-2024
Last synced: 8 months ago
JSON representation
Solutions for the Machine Learning Zoomcamp 2024
- Host: GitHub
- URL: https://github.com/jxareas/ml-zoomcamp-2024
- Owner: jxareas
- License: mit
- Created: 2024-10-03T21:42:28.000Z (about 1 year ago)
- Default Branch: master
- Last Pushed: 2024-11-04T09:34:28.000Z (12 months ago)
- Last Synced: 2025-01-20T23:46:58.757Z (9 months ago)
- Language: Python
- Size: 2.9 MB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: CODEOWNERS
- Security: SECURITY.md
Awesome Lists containing this project
README
This repository contains my scripts, answers & solutions to the Machine Learning Zoomcamp.
## ⏳ Deadlines ⏳
| ID | Module | Progress | Deadline |
|----|---------------------------------------|--------------------|-----------|
| 01 | Introduction to Machine Learning | :white_check_mark: | Sep-30-24 |
| 02 | Machine Learning for Regression | :white_check_mark: | Oct-09-24 |
| 03 | Machine Learning for Classification | :white_check_mark: | Oct-14-24 |
| 04 | Evaluation Metrics for Classification | :white_check_mark: | Oct-21-24 |
| 05 | Deploying Machine Learning Models | :white_check_mark: | Oct-28-24 |
| 06 | Decision Trees and Ensemble Learning | :white_check_mark: | Nov-04-24 |
| 7b | Midterm Project | :x: | Nov-25-24 |
| 7b | Midterm Project Evaluation | :x: | TBD |
| 08 | Neural Networks and Deep Learning | :x: | Dec-02-24 |
| 09 | Serverless Deep Learning | :x: | Dec-09-24 |
| 10 | Kubernetes and TensorFlow-Serving | :x: | Dec-16-24 |
| 11 | Kubeflow and KFServing | :x: | TBD |
| 12 | Capstone Project | :x: | Jan-06-25 |
| 12 | Capstone Project Evaluation | :x: | Jan-27-25 |
| 13 | The third Project | :x: | TBD |
| 13 | The third Project Evaluation | :x: | TBD |
| 14 | Article | :x: | TBD |
## 🚀 Modules 🚀
The course is divided into two parts (machine learning algorithms & model deployments) and 11 modules.
### [Module 1: Introduction to Machine Learning](01-introduction)
* Introduction to Machine Learning
* ML vs Rule-Based
* Model selection
* Setting up the environment
### [Module 2: Machine Learning for Regression](02-regression)
* Linear Regression
* Feature Engineering
* Regularization
* Tuning
### [Module 3: Machine Learning for Classification](03-classification)
* Logistic Regression
* One-Hot Encoding
* Model interpretation
* Feature importance: risk ratio, mutual information, correlation
### [Module 4: Evaluation Metrics for Classification](04-evaluation)
* Accuracy & Dummy model
* Confusion matrix
* Precision & Recall
* ROC Curves / ROC AUC
* Cross-Validation
### [Module 5: Deploying Machine Learning Models](05-deployment)
* Flask
* pipenv
* Docker
* AWS Elastic Beanstalk
### [Module 6: Decision Trees and Ensemble Learning](06-trees)
* Decision Trees: Learning & parameter tuning
* Ensemble Learning: Random Forest
* Gradient Boosting: XGBoost
* Model selection
### [Module 8: Neural Networks and Deep Learning](08-deep-learning)
* Tensorflow
* Keras
* Convolutional Neural Networks
* Transfer learning
* Learning rate
* Regularization
* Data augmentation
### [Module 9: Serverless Deep Learning](09-serverless)
* AWS Lambda
* Tensorflow Lite
* API Gateway
### [Module 10: Kubernetes and Tensorflow Serving](10-kubernetes)
* Tensorflow Serving
* Docker-Compose
* Kubernetes
* Amazon EKS
### [Module 11: KServe](11-kserve)
* KServe
* Custom Scikit-Learn images with KServe
* KServe Transformers
* KServe + Amazon EKS
## 🛠️ Technologies 🛠️
- **Python:** Versatile and widely-used programming language for web development, data science, and automation.
- **Anaconda:** Open-source distribution of Python and R for scientific computing and data analysis.
- **Docker:** Platform for developing, shipping, and running applications in containers.
- **Scikit-Learn:** Machine learning library for Python that provides simple and efficient tools for data mining and
data analysis.
- **TensorFlow:** Open-source platform for machine learning and deep learning developed by Google.
- **Keras:** High-level neural networks API, written in Python and capable of running on top of TensorFlow.
- **BentoML:** Framework for managing and deploying machine learning models in production environments.
- **Kubernetes:** Open-source container orchestration platform for automating deployment, scaling, and management of
containerized applications.
- **KServe:** Kubernetes-native platform for serving machine learning models on Kubernetes, providing features like
autoscaling and monitoring.
## 📜 License 📜
This is an open-source project licensed under the MIT License, as shown below:
```
MIT License
Copyright (c) 2024 Jonathan Areas
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
```