https://github.com/fburic/panda-grove
A lightweight package to manage multiple Pandas DataFrame
https://github.com/fburic/panda-grove
data-analysis data-science data-wrangling pandas
Last synced: 3 days ago
JSON representation
A lightweight package to manage multiple Pandas DataFrame
- Host: GitHub
- URL: https://github.com/fburic/panda-grove
- Owner: fburic
- License: bsd-3-clause
- Created: 2021-10-21T21:10:42.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2024-01-02T16:47:30.000Z (over 2 years ago)
- Last Synced: 2024-08-10T08:16:49.606Z (almost 2 years ago)
- Topics: data-analysis, data-science, data-wrangling, pandas
- Language: Python
- Homepage: https://panda-grove.readthedocs.io
- Size: 56.6 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README

Panda Grove
-----------------
A lightweight package for easier management of multiple Pandas DataFrames
during data analysis and exploration.
The philosophy of Panda Grove is to assist without getting in the way or
creating yet another API on top of Pandas.
It also depends exclusively on the Pandas package for minimal overhead.
## Feature Highlights
* :ballot_box_with_check: Multi-merges (merge across multiple DataFrames at once)
* :ballot_box_with_check: `Collection` class to encapsulate and manage multiple DataFrames
* :ballot_box_with_check: DataFrame sanity checks
* :black_square_button: Support all types of joins (currently only inner join)
* :black_square_button: Datatype management (work in progress)
## Installation
Install the PyPI package:
```shell
pip install panda-grove
```
## Requirements
- python >= 3.7
- pandas >= 1.1.0
- IPython >= 7 is not required but will add pretty printing in Jupyter notebooks