https://github.com/synw/dataspace
Data wrangling and visualization
https://github.com/synw/dataspace
Last synced: about 2 months ago
JSON representation
Data wrangling and visualization
- Host: GitHub
- URL: https://github.com/synw/dataspace
- Owner: synw
- License: mit
- Created: 2021-06-25T10:57:45.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2023-04-15T07:02:46.000Z (about 2 years ago)
- Last Synced: 2025-04-14T19:08:09.128Z (about 2 months ago)
- Language: Python
- Size: 13.1 MB
- Stars: 5
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Dataspace
[](https://pypi.org/project/dataspace/) [](https://coveralls.io/github/synw/dataspace?branch=main)
A simple api to explore, clean, transform and visualize data
## Features
- **Explore data**: describe, search and visualize raw data
- **Clean and transform data**: select, filter, normalize and reshape data
- **Visualize data**: many kind of charts:books: Read the documentation
- [Doc](https://synw.github.io/dataspace/doc)
- [Data io](https://synw.github.io/dataspace/doc/data_io)
- [Load](https://synw.github.io/dataspace/doc/data_io/load)
- [From df](https://synw.github.io/dataspace/doc/data_io/load/from_df)
- [From csv](https://synw.github.io/dataspace/doc/data_io/load/from_csv)
- [From django](https://synw.github.io/dataspace/doc/data_io/load/from_django)
- [Export](https://synw.github.io/dataspace/doc/data_io/export)
- [Export csv](https://synw.github.io/dataspace/doc/data_io/export/export_csv)
- [Infos](https://synw.github.io/dataspace/doc/infos)
- [View data](https://synw.github.io/dataspace/doc/infos/view_data)
- [Show](https://synw.github.io/dataspace/doc/infos/view_data/show)
- [Count data](https://synw.github.io/dataspace/doc/infos/count_data)
- [Count null ](https://synw.github.io/dataspace/doc/infos/count_data/count_null_)
- [Count zero ](https://synw.github.io/dataspace/doc/infos/count_data/count_zero_)
- [Count unique ](https://synw.github.io/dataspace/doc/infos/count_data/count_unique_)
- [Wunique ](https://synw.github.io/dataspace/doc/infos/count_data/wunique_)
- [Select](https://synw.github.io/dataspace/doc/select)
- [Limit](https://synw.github.io/dataspace/doc/select/limit)
- [Unique ](https://synw.github.io/dataspace/doc/select/unique_)
- [Clean](https://synw.github.io/dataspace/doc/clean)
- [Nulls](https://synw.github.io/dataspace/doc/clean/nulls)
- [Drop na](https://synw.github.io/dataspace/doc/clean/nulls/drop_na)
- [Drop any nulls](https://synw.github.io/dataspace/doc/clean/nulls/drop_any_nulls)
- [Drop all nulls](https://synw.github.io/dataspace/doc/clean/nulls/drop_all_nulls)
- [Fill nulls](https://synw.github.io/dataspace/doc/clean/nulls/fill_nulls)
- [Dates](https://synw.github.io/dataspace/doc/clean/dates)
- [To date](https://synw.github.io/dataspace/doc/clean/dates/to_date)
- [To tzdate](https://synw.github.io/dataspace/doc/clean/dates/to_tzdate)
- [Fdate](https://synw.github.io/dataspace/doc/clean/dates/fdate)
- [Timestamps](https://synw.github.io/dataspace/doc/clean/dates/timestamps)
- [Convert types](https://synw.github.io/dataspace/doc/clean/convert_types)
- [To int](https://synw.github.io/dataspace/doc/clean/convert_types/to_int)
- [To float](https://synw.github.io/dataspace/doc/clean/convert_types/to_float)
- [To str](https://synw.github.io/dataspace/doc/clean/convert_types/to_str)
- [To type](https://synw.github.io/dataspace/doc/clean/convert_types/to_type)
- [Clean values](https://synw.github.io/dataspace/doc/clean/clean_values)
- [Strip](https://synw.github.io/dataspace/doc/clean/clean_values/strip)
- [Strip cols](https://synw.github.io/dataspace/doc/clean/clean_values/strip_cols)
- [Roundvals](https://synw.github.io/dataspace/doc/clean/clean_values/roundvals)
- [Replace](https://synw.github.io/dataspace/doc/clean/clean_values/replace)
- [Transform](https://synw.github.io/dataspace/doc/transform)
- [Dataframe](https://synw.github.io/dataspace/doc/transform/dataframe)
- [Split ](https://synw.github.io/dataspace/doc/transform/dataframe/split_)
- [Drop](https://synw.github.io/dataspace/doc/transform/dataframe/drop)
- [Add](https://synw.github.io/dataspace/doc/transform/dataframe/add)
- [Rename](https://synw.github.io/dataspace/doc/transform/dataframe/rename)
- [Keep](https://synw.github.io/dataspace/doc/transform/dataframe/keep)
- [Copycol](https://synw.github.io/dataspace/doc/transform/dataframe/copycol)
- [Reverse](https://synw.github.io/dataspace/doc/transform/dataframe/reverse)
- [Values](https://synw.github.io/dataspace/doc/transform/values)
- [Sort](https://synw.github.io/dataspace/doc/transform/values/sort)
- [Exclude](https://synw.github.io/dataspace/doc/transform/values/exclude)
- [Append](https://synw.github.io/dataspace/doc/transform/values/append)
- [Mappend](https://synw.github.io/dataspace/doc/transform/values/mappend)
- [Diffm](https://synw.github.io/dataspace/doc/transform/values/diffm)
- [Resample timeseries](https://synw.github.io/dataspace/doc/transform/resample_timeseries)
- [Resample](https://synw.github.io/dataspace/doc/transform/resample_timeseries/resample)
- [Rsum](https://synw.github.io/dataspace/doc/transform/resample_timeseries/rsum)
- [Rmean](https://synw.github.io/dataspace/doc/transform/resample_timeseries/rmean)
- [Calculations](https://synw.github.io/dataspace/doc/transform/calculations)
- [Percent](https://synw.github.io/dataspace/doc/transform/calculations/percent)
- [Diffp](https://synw.github.io/dataspace/doc/transform/calculations/diffp)
- [Diffpp](https://synw.github.io/dataspace/doc/transform/calculations/diffpp)
- [Diffn](https://synw.github.io/dataspace/doc/transform/calculations/diffn)
- [Diffnp](https://synw.github.io/dataspace/doc/transform/calculations/diffnp)
- [Diffm](https://synw.github.io/dataspace/doc/transform/calculations/diffm)
- [Diffmp](https://synw.github.io/dataspace/doc/transform/calculations/diffmp)
- [Cvar ](https://synw.github.io/dataspace/doc/transform/calculations/cvar_)
- [Lreg ](https://synw.github.io/dataspace/doc/transform/calculations/lreg_)
- [Charts](https://synw.github.io/dataspace/doc/charts)
- [Options](https://synw.github.io/dataspace/doc/charts/options)
- [Axis](https://synw.github.io/dataspace/doc/charts/options/axis)
- [Altair](https://synw.github.io/dataspace/doc/charts/options/altair)
- [Bokeh](https://synw.github.io/dataspace/doc/charts/options/bokeh)
- [W](https://synw.github.io/dataspace/doc/charts/options/w)
- [H](https://synw.github.io/dataspace/doc/charts/options/h)
- [Wh](https://synw.github.io/dataspace/doc/charts/options/wh)
- [Draw charts](https://synw.github.io/dataspace/doc/charts/draw_charts)
- [Line ](https://synw.github.io/dataspace/doc/charts/draw_charts/line_)
- [Point ](https://synw.github.io/dataspace/doc/charts/draw_charts/point_)
- [Bar ](https://synw.github.io/dataspace/doc/charts/draw_charts/bar_)
- [Area ](https://synw.github.io/dataspace/doc/charts/draw_charts/area_)
- [Square ](https://synw.github.io/dataspace/doc/charts/draw_charts/square_)
- [Rule ](https://synw.github.io/dataspace/doc/charts/draw_charts/rule_)
- [Tick ](https://synw.github.io/dataspace/doc/charts/draw_charts/tick_)
- [Bar num ](https://synw.github.io/dataspace/doc/charts/draw_charts/bar_num_)
- [Line num ](https://synw.github.io/dataspace/doc/charts/draw_charts/line_num_)
- [Point num ](https://synw.github.io/dataspace/doc/charts/draw_charts/point_num_)
- [Heatmap ](https://synw.github.io/dataspace/doc/charts/draw_charts/heatmap_)
- [Hist ](https://synw.github.io/dataspace/doc/charts/draw_charts/hist_)
- [Hline ](https://synw.github.io/dataspace/doc/charts/draw_charts/hline_)
- [Inline api](https://synw.github.io/dataspace/doc/charts/inline_api)
- [W](https://synw.github.io/dataspace/doc/charts/inline_api/w)
- [H](https://synw.github.io/dataspace/doc/charts/inline_api/h)
- [Wh](https://synw.github.io/dataspace/doc/charts/inline_api/wh)
- [Mw](https://synw.github.io/dataspace/doc/charts/inline_api/mw)
- [Pw](https://synw.github.io/dataspace/doc/charts/inline_api/pw)
- [Color](https://synw.github.io/dataspace/doc/charts/inline_api/color)
- [Opacity](https://synw.github.io/dataspace/doc/charts/inline_api/opacity)
- [Tooltip](https://synw.github.io/dataspace/doc/charts/inline_api/tooltip)
- [To](https://synw.github.io/dataspace/doc/charts/inline_api/to)
- [Rx](https://synw.github.io/dataspace/doc/charts/inline_api/rx)
- [Nox](https://synw.github.io/dataspace/doc/charts/inline_api/nox)
- [Noy](https://synw.github.io/dataspace/doc/charts/inline_api/noy)
- [Title](https://synw.github.io/dataspace/doc/charts/inline_api/title)
- [Colormap](https://synw.github.io/dataspace/doc/charts/inline_api/colormap)
- [Qcolormap](https://synw.github.io/dataspace/doc/charts/inline_api/qcolormap)
- [Save img](https://synw.github.io/dataspace/doc/charts/inline_api/save_img)
- [Get html ](https://synw.github.io/dataspace/doc/charts/inline_api/get_html_)
- [Html header ](https://synw.github.io/dataspace/doc/charts/inline_api/html_header_)
- [Reporting](https://synw.github.io/dataspace/doc/reporting)
- [Prepare a report](https://synw.github.io/dataspace/doc/reporting/prepare_a_report)
- [Report path](https://synw.github.io/dataspace/doc/reporting/prepare_a_report/report_path)
- [Stack](https://synw.github.io/dataspace/doc/reporting/prepare_a_report/stack)
- [Export](https://synw.github.io/dataspace/doc/reporting/export)
- [Save pdf](https://synw.github.io/dataspace/doc/reporting/export/save_pdf)
- [Save html](https://synw.github.io/dataspace/doc/reporting/export/save_html)This api is:
- *Minimalistic*: short names, simple functionalites, minimal parameters
- *Pragmatic*: focuses on raw efficiency rather than strictly idiomatic code while favouring static typing
- *Simple stupid*: keep it easy to understand for both code and api## Install
```
pip install dataspace
```Optional: to use the Bokeh chart engine:
```
pip install bokeh holoviews
```Note: the Pandas dependency is required for this chart engine as it does
not yet support Polars dataframes## Dependencies
- Dataframe: [Polars](https://github.com/pola-rs/polars)
- Charts: [Altair](https://github.com/altair-viz/altair), and [Holoviews](https://github.com/holoviz/holoviews) with [Bokeh](https://github.com/bokeh/bokeh) (optional)## Example notebooks
[Example notebooks](https://github.com/synw/dataspace_notebooks) are available
[](https://mybinder.org/v2/gh/synw/dataspace_notebooks/HEAD)
```
├── chart
│ ├── area
│ ├── bar
│ ├── hline
│ ├── line
│ └── point
├── clean
│ ├── convert
│ ├── date
│ ├── drop_nulls
│ ├── fill_nulls
│ ├── strip
│ └── values
├── count
│ ├── count_empty
│ ├── count_null
│ ├── count_unique
│ └── count_zero
├── io
│ ├── export
│ └── load
├── select
│ ├── limit
│ ├── unique
│ └── wunique
└── transform
├── dataframe
│ ├── add
│ ├── copycol
│ ├── drop
│ ├── keep
│ ├── rename
│ └── split
├── diff
│ ├── diffm
│ ├── diffmp
│ ├── diffn
│ ├── diffnp
│ ├── diffp
│ └── diffpp
├── resample
│ ├── rmean
│ └── rsum
└── values
├── append
├── exclude
├── reverse
└── sort
```## Tests
To run the tests:
```bash
make test
```