Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tidypyverse/tidypandas
A grammar of data manipulation for pandas inspired by tidyverse
https://github.com/tidypyverse/tidypandas
data-analysis data-science dataframe dataframe-library dplyr pandas python tidyverse
Last synced: 2 months ago
JSON representation
A grammar of data manipulation for pandas inspired by tidyverse
- Host: GitHub
- URL: https://github.com/tidypyverse/tidypandas
- Owner: tidypyverse
- License: mit
- Created: 2020-01-22T06:23:22.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2024-02-28T18:50:04.000Z (11 months ago)
- Last Synced: 2024-10-31T12:21:12.641Z (2 months ago)
- Topics: data-analysis, data-science, dataframe, dataframe-library, dplyr, pandas, python, tidyverse
- Language: Python
- Homepage: https://tidypyverse.github.io/tidypandas/
- Size: 5.75 MB
- Stars: 92
- Watchers: 5
- Forks: 7
- Open Issues: 9
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
- awesome-dataframes - tidypandas - A grammar of data manipulation for pandas inspired by tidyverse. (Libraries)
README
![](docs/logo.png)
[![PyPI
version](https://badge.fury.io/py/tidypandas.svg)](https://badge.fury.io/py/tidypandas)# `tidypandas`
> A **grammar of data manipulation** for
> [pandas](https://pandas.pydata.org/docs/index.html) inspired by
> [tidyverse](https://tidyverse.tidyverse.org/)`tidypandas` python package provides *minimal, pythonic* API for common
data manipulation tasks:- `tidyframe` class (wrapper over pandas dataframe) provides a
dataframe with simplified index structure (no more resetting indexes
and multi indexes)
- Consistent ‘verbs’ (`select`, `arrange`, `distinct`, …) as methods
to `tidyframe` class which mostly return a `tidyframe`
- Unified interface for summarizing (aggregation) and mutate (assign)
operations across groups
- Utilites for pandas dataframes and series
- Uses simple python data structures, No esoteric classes, No pipes,
No Non-standard evaluation
- No copy data conversion between `tidyframe` and pandas dataframes
- An accessor to apply `tidyframe` verbs to simple pandas datarames
- …## Example
- `tidypandas` code:
df.filter(lambda x: x['col_1'] > x['col_1'].mean(), by = 'col_2')
- equivalent pandas code:
(df.groupby('col2')
.apply(lambda x: x.loc[x['col_1'] > x['col_1'].mean(), :])
.reset_index(drop = True)
)## Why use `tidypandas`
`tidypandas` is for you if:
- you *frequently* write data manipulation code using pandas
- you prefer to have stay in pandas ecosystem (see accessor)
- you *prefer* to remember a [limited set of
methods](https://medium.com/dunder-data/minimally-sufficient-pandas-a8e67f2a2428)
- you do not want to write (or be surprised by)
[`reset_index`](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.reset_index.html),
[`rename_axis`](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.rename_axis.html)
often
- you prefer writing free flowing, expressive code in
[dplyr](https://dplyr.tidyverse.org/) style> `tidypandas` relies on the amazing `pandas` library and offers a
> consistent API with a different
> [philosophy](https://tidyverse.tidyverse.org/articles/manifesto.html).## Presentation
Learn more about tidypandas
([presentation](https://github.com/talegari/tidypandas/blob/master/docs/tp_pres.html))## Installation
1. Install release version from Pypi using pip:
pip install tidypandas
2. For offline installation, use whl/tar file from the [releases
page](https://github.com/talegari/tidypandas/releases) on github.## Contribution/bug fixes/Issues:
1. Open an issue/suggestion/bugfix on the github
[issues](https://github.com/talegari/tidypandas/issues) page.2. Use the master branch from
[github](https://github.com/talegari/tidypandas) repo to submit your
PR.------------------------------------------------------------------------