Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/asut00/machine-learning-piscine_42ai

Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.
https://github.com/asut00/machine-learning-piscine_42ai

linear-regression machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 3 days ago
JSON representation

Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.

Host: GitHub
URL: https://github.com/asut00/machine-learning-piscine_42ai
Owner: asut00
Created: 2024-12-17T21:32:26.000Z (about 2 months ago)
Default Branch: main
Last Pushed: 2025-01-28T14:19:38.000Z (18 days ago)
Last Synced: 2025-02-12T19:08:29.983Z (3 days ago)
Topics: linear-regression, machine-learning, matplotlib, numpy, pandas, python, scikit-learn
Language: Python
Homepage:
Size: 3.76 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        
# Machine Learning Course - 42AI 🦾

## Overview

This repository contains all the materials, exercises, and solutions from the week-long Machine Learning Bootcamp organized by 42AI, the Artificial Intelligence association of École 42. The bootcamp was designed to provide a comprehensive introduction to Machine Learning (ML), including foundational mathematics, implementation of ML algorithms, and practical applications. Participants worked through hands-on projects to build a strong understanding of machine learning principles.

## Contents

The bootcamp is divided into several modules, each focusing on key Machine Learning concepts and techniques. Below is an outline of the modules:

### Day 1: Stepping into Machine Learning

- **Topics Covered:**

  - Introduction to Machine Learning

  - Supervised vs. Unsupervised Learning

  - Linear Algebra Review

    - Matrix operations

    - Transpose, addition, multiplication

  - Building a Regression Model

    - Hypothesis, cost function, and evaluation metrics

- **Key Exercises:**

  - Implementing a `Matrix` and `Vector` class with basic operations

  - Creating a basic statistical class, `TinyStatistician`, for computing mean, median, variance, and standard deviation

### Day 2: Univariate Linear Regression

- **Topics Covered:**

  - Gradient Descent for Linear Regression

  - Cost function minimization

  - Learning rate tuning

  - Normalization techniques

- **Key Exercises:**

  - Implementing gradient descent iteratively and using vectorization

  - Training a regression model to predict outcomes from one feature

### Day 3: Multivariate Linear Regression

- **Topics Covered:**

  - Extending regression models to handle multiple features

  - Polynomial regression for non-linear relationships

  - Feature scaling and engineering

  - Understanding overfitting and underfitting

- **Key Exercises:**

  - Implementing multivariate linear regression using vectorized gradient descent

  - Evaluating model performance with loss functions

  - Implementing polynomial feature expansion

### Day 4: Logistic Regression  

**Topics Covered:**  

- Introduction to Logistic Regression  

  - Logistic hypothesis  

  - Decision boundary and cost function  

  - Gradient descent for logistic regression  

- Classification Metrics  

  - Accuracy, precision, recall, F1-score, and confusion matrix  

**Key Exercises:**  

- Implementation of the sigmoid function and logistic hypothesis.  

- Building a logistic regression model using gradient descent.  

- Evaluating performance through classification metrics (confusion matrix, F1-score).  

### Day 5: Regularization  

**Topics Covered:**  

- Understanding Overfitting  

  - Causes and how to detect overfitting  

  - Concepts of regularization (L2 regularization)  

- Regularized Regression  

  - Regularized cost functions for linear and logistic regression  

  - Regularized gradient descent  

**Key Exercises:**  

- Implementing L2 regularization and applying it to regression models.  

- Extending linear and logistic regression to include regularization for better generalization.  

- Comparing regularized and non-regularized models to understand their impact on performance.  

## Acknowledgments

This bootcamp was organized by 42AI, with materials inspired by the Machine Learning MOOC by Andrew Ng. Special thanks to the mentors and peers who contributed to the collaborative learning environment.

## Resources

- [Andrew Ng’s Machine Learning Course on Coursera](https://www.coursera.org/learn/machine-learning)

- [42AI GitHub Repository](https://github.com/42-AI)