An open API service indexing awesome lists of open source software.

https://github.com/alastairrushworth/inspectpd

📈 📊 Methods for Exploring Pandas Data Frames
https://github.com/alastairrushworth/inspectpd

categorical-data-visualization correlation dataframe eda exploratory-data-analysis pandas python

Last synced: 11 months ago
JSON representation

📈 📊 Methods for Exploring Pandas Data Frames

Awesome Lists containing this project

README

          

# inspectpd

## Overview

`inspectpd` is collection of utilities for columnwise summary,
comparison and visualisation of `pandas` dataframes. The package patches
in the following methods for pandas objects:

- [`.inspect_types()`](#column-types) summary of column types
- [`.inspect_mem()`](#memory-usage) summary of memory usage of columns
- [`.inspect_na()`](#missing-values) columnwise prevalence of missing
values
- [`.inspect_cor()`](#correlation) correlation coefficients of numeric
columns
- [`.inspect_imb()`](#feature-imbalance) feature imbalance of
non-numeric
- [`.inspect_num()`](#numeric-summaries) summaries of numeric columns
- [`.inspect_cat()`](#categorical-levels) summaries of non-numeric

## Installation and use

You can install `inspectpd` using `pip` with

```python
pip install inspectpd
```

Simply import the package and use the methods on a pandas dataframe:

```
import pandas as pd
import inspectpd
# example data set just for illustration
from inspectpd import starwars

# categorical features summary
starwars.inspect_cat()
```

You can also get a quick visualisation of the summary too:

```
# get a plot of the categorical features
starwars.inspect_cat().view()
```