Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vnegi10/dataframe_benchmarks
Comparison of DataFrame performance between Julia, Python and Rust
https://github.com/vnegi10/dataframe_benchmarks
Last synced: 12 days ago
JSON representation
Comparison of DataFrame performance between Julia, Python and Rust
- Host: GitHub
- URL: https://github.com/vnegi10/dataframe_benchmarks
- Owner: vnegi10
- Created: 2023-08-19T12:52:37.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-08-20T15:49:55.000Z (over 1 year ago)
- Last Synced: 2024-11-29T03:13:02.555Z (24 days ago)
- Language: Jupyter Notebook
- Size: 24.4 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## DataFrame benchmarks
This repository contains Jupyter notebooks which are
used to execute and compare DataFrame benchmarks between
[Pandas](https://pandas.pydata.org/), [Polars](https://github.com/pola-rs/polars)
and [DataFrames.jl](https://github.com/JuliaData/DataFrames.jl) packages.## Versions
- Python 3.10.6
- Polars 0.18.15
- Pandas 2.0.3- Julia 1.9.2
- DataFrames.jl 1.6.1## Data source
- Hard drive [S.M.A.R.T.](http://ntfs.com/disk-monitor-smart-attributes.htm) data has been
graciously made available for public use by Backblaze. The quarterly CSV data (per day)
are bundled together into a zipped file, which can be downloaded from
[here.](https://www.backblaze.com/b2/hard-drive-test-data.html) We are making use of
data for only a selected number of days.- Steam games recommendation data has been obtained from kaggle. More information
is available [here.](https://www.kaggle.com/datasets/antonkozyriev/game-recommendations-on-steam?select=recommendations.csv)- Rotten Tomatoes movies review data has been obtained from kaggle. More information
can be found [here.](https://www.kaggle.com/datasets/stefanoleone992/rotten-tomatoes-movies-and-critic-reviews-dataset?select=rotten_tomatoes_critic_reviews.csv)