An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with pyspark-dataframes

A curated list of projects in awesome lists tagged with pyspark-dataframes .

https://github.com/sbl-sdsc/df-parallel

Comparison of Dataframe libraries for parallel processing of large tabular files on CPU and GPU.

cuda-toolkit dask dask-cudf dask-dataframes dataframes gpu-computing parallel-processing pyspark-dataframes rapidsai

Last synced: 12 Apr 2025

https://github.com/maltzsama/sumeh

Sumeh — Unified Data Quality Framework Sumeh is a unified data quality validation framework supporting multiple backends (PySpark, Dask, Polars, DuckDB, Pandas) with centralized rule configuration.

dask-dataframes data data-quality data-quality-analysis data-quality-assessment data-quality-checks data-quality-framework data-quality-measurement data-quality-report duckdb duckdb-extension pandas pandas-library polars polars-dataframe polars-extensions pyspark pyspark-dataframes

Last synced: 09 Mar 2026