Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/igorbenav/dexter
Data Exploration Terser
https://github.com/igorbenav/dexter
data-analysis data-science eda exploratory-data-analysis pandas
Last synced: about 2 months ago
JSON representation
Data Exploration Terser
- Host: GitHub
- URL: https://github.com/igorbenav/dexter
- Owner: igorbenav
- License: bsd-3-clause
- Created: 2021-03-26T23:57:10.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2024-01-20T12:33:18.000Z (11 months ago)
- Last Synced: 2024-10-01T09:31:22.528Z (3 months ago)
- Topics: data-analysis, data-science, eda, exploratory-data-analysis, pandas
- Language: Python
- Homepage:
- Size: 481 KB
- Stars: 10
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# dexter
[![PyPI version](https://badge.fury.io/py/dexter.svg)](https://badge.fury.io/py/dexter)
[![GitHub license](https://img.shields.io/github/license/igormagalhaesr/dexter)](https://github.com/igormagalhaesr/dexter/blob/main/LICENSE.txt)
[![Documentation Status](https://readthedocs.org/projects/dexter/badge/?version=latest)](https://dexter.readthedocs.io/en/latest/?badge=latest)
___
>data exploration terser## What is dexter?
**dexter** is a lightweight Python package built on top of **numpy** and **pandas** that allows fast data exploration
for multiple structured table files in a folder. It's a high-level tool suitable for a first contact with a dataset
composed of multiple dataframes.![](img/dexter.png)
## Features
- Importing multiple table files with **readm_csv()**
- Saving DataFrames and Names with the **FrameMap** class
- Applying pandas methods to multiple DataFrames at once## Installing
PyPI:
```sh
pip install dexter
```https://pypi.org/project/dexter/
## Usage
Importing dexter:
```python
import dexter as dxt
```Reading multiple dataframes in a folder:
```python
dataframes = dxt.readm_csv("./folder/")
```Displaying results:
```python
dataframes.display()
```
```Out:```
df1
col1
col2
col3
0
value1
$3
1
value2
$5
2
value3
$7
df2
first
second
third
0
one
17%
1
two
19%
2
three
23%
Names and Frames
```python
names = dataframes.names
frames = dataframes.frames
```Multiple Dataframes Types
```python
dataframes.dtypes()
```Multiple Missing Values
```python
dataframes.multiple_missing()
```Multiple Descriptions
```python
dataframes.describe()
```![](img/first_use.png)
![](img/describe.png)
![](img/memory_usage.png)For more concrete examples, check the [notebook](Example.ipynb)
## Contributing
1. Fork it (https://github.com/igormagalhaesr/dexter)
2. Create your feature branch (`git checkout -b feature/fooBar`)
3. Commit your changes (`git commit -am 'Add some fooBar'`)
4. Push to the branch (`git push origin feature/fooBar`)
5. Create a new Pull Request## Dependencies
- [NumPy](https://www.numpy.org)
- [Pandas](https://pandas.pydata.org/)
- [IPython](https://ipython.org/)## License
Distributed under the [BSD 3](LICENSE.txt) license. See ``LICENSE.txt`` for more information.
## Contact
Igor Magalhaes – [@igormagalhaesr](https://twitter.com/igormagalhaesr) – [email protected]
[github.com/igormagalhaesr](https://github.com/igormagalhaesr/)