https://github.com/ryu1kn/sandbox--data-wrangling
sandbox--data-wrangling
https://github.com/ryu1kn/sandbox--data-wrangling
Last synced: about 1 year ago
JSON representation
sandbox--data-wrangling
- Host: GitHub
- URL: https://github.com/ryu1kn/sandbox--data-wrangling
- Owner: ryu1kn
- Created: 2023-04-05T14:40:58.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-01-30T01:59:35.000Z (over 2 years ago)
- Last Synced: 2025-03-28T14:06:06.425Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 893 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Sandbox: Data Wrangling
Playing with
* PyArrow
* DuckDB
* Polars
## Usage
```sh
poetry install
poetry run scripts/prepare-data.py
```
Then go play with notebooks under [notebooks](./notebooks) directory :)
## Troubleshooting
While installing dependencies with poetry, if `debugpy` installation fails, you may
temporarily disable `modern-installation`. See https://github.com/microsoft/debugpy/issues/1246
```sh
poetry config installer.modern-installation false
```
## References
* [Polars](https://pola-rs.github.io/polars-book/user-guide/introduction.html)
* [DuckDB](https://duckdb.org/)