Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-dataframes
An awesome list of dataframe libraries
https://github.com/jcmkk3/awesome-dataframes
Last synced: about 10 hours ago
JSON representation
-
Libraries
- pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more.
- Polars - Fast multi-threaded DataFrame library in Rust and Python.
- Modin - Speed up your Pandas workflows by changing a single line of code.
- Ibis - A pandas-like deferred expression system, with first-class SQL support.
- agate - agate is a Python data analysis library that is optimized for humans instead of machines. It is an alternative to numpy and pandas that solves real-world problems with readable code.
- Lemuras - A small *pure* Python library to deal with big tables.
- datatable - A Python package for manipulating 2-dimensional tabular data structures.
- Prosto - A Python data processing toolkit to programmatically author and execute complex data processing workflows. Conceptually, it is an alternative to purely set-oriented approaches to data processing like map-reduce, relational algebra, SQL or data-frame-based tools like pandas.
- siuba - Python library for using dplyr like syntax with pandas and SQL.
- Vaex - A high performance Python library for lazy Out-of-Core DataFrames (similar to Pandas), to visualize and explore big tabular datasets.
- dfply - dplyr-style piping operations for pandas dataframes.
- kadro - A friendly pandas wrapper with a more composable grammar support.
- dexplo - Data exploration library with a pandas-like API.
- pands_cub - A detailed project that teaches you how to build your own Python data analysis library, pandas_cub, from scratch.
- fletcher - Pandas ExtensionDType/Array backed by Apache Arrow.
- tidypandas - A grammar of data manipulation for pandas inspired by tidyverse.
- redframes - [re]ctangular[d]ata[frames]
- dplyr - A grammar of data manipulation, providing a consistent set of verbs that help you solve the most common data manipulation challenges.
- data.table - Provides a high-performance version of base R's `data.frame` with syntax and feature enhancements for ease of use, convenience and programming speed.
- Arquero - A JavaScript library for query processing and transformation of array-backed data tables. Following the relational algebra and inspired by the design of dplyr, Arquero provides a fluent API for manipulating column-oriented data frames.
- dataflow-api - JavaScript API for dataflow processing using the vega-dataflow reactive engine. Perform common database operations (sorting, filtering, aggregation, window calculations) over JavaScript objects.
- datalib - A JavaScript data utility library. It provides facilities for data loading, type inference, common statistics, and string templates.
- Tidy.js - Tidy up your data with JavaScript, inspired by dplyr and the tidyverse.
- Data-Forge - The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
- zebras - A data manipulation and analysis library written in JavaScript offering the convenience of pandas or R.
- dataframe-js - A javascript library providing a new data structure for datascientists and developers.
- DataFrames.jl - Tools for working with tabular data in Julia.
- DataKnots.jl - A Julia library for querying data with an extensible, practical and coherent algebra of query combinators.
- Volcanito.jl - Backend agnostic for tabular data operations in Julia.
- Query.jl - A package for querying julia data sources. It can filter, project, join and group data from any iterable data source, including all the sources supported in IterableTables.jl.
- TidierData.jl - 100% Julia implementation of the dplyr and tidyr R packages.
- tech.ml.dataset - A Clojure high performance data processing system.
- tablecloth - Dataset manipulation library build on the top of tech.ml.dataset.
- Data Frame - Data frames for Common Lisp
- DataFrame - A C++ statistical library that provides an interface similar to Pandas package in Python.
- tidy - Leaning heavily on the principles of the tidyverse, and especially tidy data, this package makes it easy to reshape and tidy tabular data for easier data analysis and visualization.
- column - High-performance, columnar, in-memory store with bitmap indexing in Go.
- gambas - Data analysis tool for Go. Similar to the famous Python library pandas.
- Tablesaw - Java dataframe and visualization library.
- Kotlin Dataframe - Structured data processing in Kotlin.
- krangl - A {K}otlin library for data w{rangl}ing.
- Assistant - A data science library providing flexible dataframes for Lua 5.1+
- Data::Reshapers - Raku package with data reshaping functions for different data structures.
- rover - Simple, powerful data frames for Ruby.
- daru - daru (Data Analysis in RUby) is a library for storage, analysis, manipulation and visualization of data in Ruby.
- polars-ruby - Blazingly fast DataFrames for Ruby.
- VisiData - A terminal spreadsheet multitool for discovering and arranging data.
- qSQL - Query language embedded into Q.
- datafusion - DataFrame API in Apache Arrow DataFusion
- Dataset - Represents a structured dataset based on a hierarchy of lists and associations.
- SQLite - A C-language library that implements a small, fast, self-contained, high-reliability, full-featured, SQL database engine
- polars - A blazingly fast DataFrames library implemented in Rust.
- dance - Dancing 💃 with the stats, aka `tibble()` dancing 🕺. dance is a sort of reinvention of dplyr classic verbs, with a more modern stack underneath, i.e. it leverages a lot from vctrs and rlang.
- Narwhals - Lightweight and extensible compatibility layer between dataframe libraries!
-
Other
- Arrow - A cross-language development platform for in-memory data.
- Substrait - A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
- Consortium for Python Data APIs
- prql - A modern language for transforming data — a simple, powerful, pipelined SQL replacement.
-
Papers
-
Other Lists
Programming Languages
Categories
Sub Categories
Keywords
python
14
dataframe
13
pandas
12
data-analysis
11
data-science
11
sql
7
data
7
csv
4
machine-learning
4
datascience
4
polars
4
json
4
dplyr
4
tabular-data
3
statistical-analysis
3
julia
3
data-wrangling
3
visualization
3
java
3
data-frame
3
arrow
3
dataframe-library
3
javascript
3
dataframes
3
dataset
2
clojure
2
table
2
data-visualization
2
tidyverse
2
html
2
kotlin
2
bigdata
2
data-manipulation
2
sqlite
2
out-of-core
2
r
2
cli
2
pyarrow
2
rust
2
query
2
hdf5
2
machinelearning
2
tsv
2
database
2
memory-mapped-file
1
etl
1
pandas-dataframe
1
pydata
1
grammar
1
arrays
1