An open API service indexing awesome lists of open source software.

https://github.com/pierrekieffer/datapreprocessing

Custom data preprocessing library made for machine learning
https://github.com/pierrekieffer/datapreprocessing

data-preparation data-preprocessing machine-learning preprocessing scikit-learn

Last synced: about 1 month ago
JSON representation

Custom data preprocessing library made for machine learning

Awesome Lists containing this project

README

          

# DataPreprocessing
This preprocessing library provides custom preprocessing functions to improve data preprocessing work for machine learning

## Data loaders
## Features selection
- identify_missing : Find features with missing values
- identify_collinear : Find highly correlated features
- identify_features_importance : Find most important features for classification
- identify_single_value_features : Find single value features
## Data transformation
- encode_data : transform object values to numericals
- reverse_encode_data : back to object values
- scale : scaling data
- principal_components_analysis : transform features to (n) uncorrelated features