An open API service indexing awesome lists of open source software.

https://github.com/jesussantana/supervised-classification

Let’s practice and become familiar with classification algorithms.
https://github.com/jesussantana/supervised-classification

automl automl-algorithms classification-algorithms classification-models classification-trees knn-classifier lazzyclassifier logistic-regression python support-vector-machine xgboost

Last synced: 2 months ago
JSON representation

Let’s practice and become familiar with classification algorithms.

Awesome Lists containing this project

README

        

# [IT Academy - Data Science with Python](https://www.barcelonactiva.cat/es/itacademy)
## [S13 T01: Supervised Classification](https://github.com/jesussantana/Supervised-Classification/blob/main/notebooks/S13_T01_Supervised_Classification.ipynb)

[![forthebadge made-with-python](http://ForTheBadge.com/images/badges/made-with-python.svg)](https://www.python.org/)
[![Made withJupyter](https://img.shields.io/badge/Made%20with-Jupyter-orange?style=for-the-badge&logo=Jupyter)](https://jupyter.org/try)
[![wakatime](https://wakatime.com/badge/github/jesussantana/Supervised-Classification.svg)](https://wakatime.com/badge/github/jesussantana/Supervised-Classification)

### Description

Let’s practice and become familiar with classification algorithms.

### Level 1

- Exercise 1:
- Create at least three different classification models to try to best predict DelayedFlights.csv flight delay (ArrDelay).

- Exercise 2:
- Creates a new variable depending on whether the flight arrived late or not (ArrDelay> 0).

- Exercise 3:
- Compare classification models using accuracy, a confidence matrix, and other more advanced metrics.

- Exercise 4:
- Train them using the different parameters they support.

- Exercise 5:
- Compare your performance using the traint / test approach or using all data (internal validation).

### Level 2

- Exercise 5:
- Perform some variable engineering process to improve prediction.

### Level 3

- Exercise 6:
- Do not use the DepDelay variable when making predictions.

### Targets

- Classification trees
- KNN - k-Nearest Neighbors
- Logistic Regression
- Support Vector Machine
- XGboost