An open API service indexing awesome lists of open source software.

https://github.com/wa-lead/ml485_blind_preprocessing_prediction_comp

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.
https://github.com/wa-lead/ml485_blind_preprocessing_prediction_comp

data-engineering data-visualization machine-learning python

Last synced: 10 days ago
JSON representation

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

Awesome Lists containing this project

README

          

# ML485_blind_preprocessing_prediction_comp
This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

# Data Preprocessing
To gain an understanding of the data structure and the separation of classes, the initial step involved using UMAP (Uniform Manifold Approximation and Projection). Additionally, the data was imputed, and generic data engineering techniques were applied. Finally, feature selection was performed using mutual information.

# Prediction
Several models were evaluated, and XGBoost emerged as the top-performing model for prediction.

# Gain Further Insights into the Data
To gain insights into the weaknesses of the model, various techniques were employed to understand the reasons behind incorrect predictions (details can be found in the "main.ipynb" file).

Please refer to the "main.ipynb" file for a more comprehensive analysis and implementation details.

" Won first place btw 🥱 "