Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/chris-santiago/polars-play


https://github.com/chris-santiago/polars-play

Last synced: 10 days ago
JSON representation

Awesome Lists containing this project

README

        

# Comparing Pandas, Polars and Vaex

Using 2019-2022 data from [NYC Taxi Data](https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page).

Number of rows: 179,807,942
Dataset size: 24.4GB

## Performance

|Framework|Read|Groupby|Total
|---|---|---|---|
|Pandas|dies|n/a|n/a|
|Polars|1ms|2.3s|2.3s|
|Vaex|416ms|5.3s|5.3s|

### Notes

- Data is stored in parquet files on local disk
- Using M1 MBA with 8GB RAM-- unsure how memory swap impacts result