https://github.com/pierrekieffer/datapreprocessing
Custom data preprocessing library made for machine learning
https://github.com/pierrekieffer/datapreprocessing
data-preparation data-preprocessing machine-learning preprocessing scikit-learn
Last synced: about 1 month ago
JSON representation
Custom data preprocessing library made for machine learning
- Host: GitHub
- URL: https://github.com/pierrekieffer/datapreprocessing
- Owner: PierreKieffer
- Created: 2018-08-28T20:40:11.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2019-04-02T14:21:44.000Z (about 7 years ago)
- Last Synced: 2023-03-07T01:31:41.952Z (about 3 years ago)
- Topics: data-preparation, data-preprocessing, machine-learning, preprocessing, scikit-learn
- Language: Python
- Homepage:
- Size: 4.88 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DataPreprocessing
This preprocessing library provides custom preprocessing functions to improve data preprocessing work for machine learning
## Data loaders
## Features selection
- identify_missing : Find features with missing values
- identify_collinear : Find highly correlated features
- identify_features_importance : Find most important features for classification
- identify_single_value_features : Find single value features
## Data transformation
- encode_data : transform object values to numericals
- reverse_encode_data : back to object values
- scale : scaling data
- principal_components_analysis : transform features to (n) uncorrelated features