Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/WillKoehrsen/feature-selector

Feature selector is a tool for dimensionality reduction of machine learning datasets
https://github.com/WillKoehrsen/feature-selector

Last synced: 2 months ago
JSON representation

Feature selector is a tool for dimensionality reduction of machine learning datasets

Awesome Lists containing this project

README

        

# Feature Selector: Simple Feature Selection in Python

Feature selector is a tool for dimensionality reduction of machine learning datasets.

# Methods

There are five methods used to identify features to remove:

1. Missing Values
2. Single Unique Values
3. Collinear Features
4. Zero Importance Features
5. Low Importance Features

## Usage

Refer to the [Feature Selector Usage notebook](https://github.com/WillKoehrsen/feature-selector/blob/master/Feature%20Selector%20Usage.ipynb) for how to use

## Visualizations

The `FeatureSelector` also includes a number of visualization methods to inspect
characteristics of a dataset.

__Correlation Heatmap__

![](images/example_collinear_heatmap.png)

__Most Important Features__

![](images/example_top_feature_importances.png)

Requires:

```
python==3.6+
lightgbm==2.1.1
matplotlib==2.1.2
seaborn==0.8.1
numpy==1.22.0
pandas==0.23.1
scikit-learn==0.19.1
```

## Contact

Any questions can be directed to [email protected]!