Projects in Awesome Lists tagged with pyspark-dataframes
A curated list of projects in awesome lists tagged with pyspark-dataframes .
https://github.com/sbl-sdsc/df-parallel
Comparison of Dataframe libraries for parallel processing of large tabular files on CPU and GPU.
cuda-toolkit dask dask-cudf dask-dataframes dataframes gpu-computing parallel-processing pyspark-dataframes rapidsai
Last synced: 12 Apr 2025
https://github.com/maltzsama/sumeh
Sumeh — Unified Data Quality Framework Sumeh is a unified data quality validation framework supporting multiple backends (PySpark, Dask, Polars, DuckDB, Pandas) with centralized rule configuration.
dask-dataframes data data-quality data-quality-analysis data-quality-assessment data-quality-checks data-quality-framework data-quality-measurement data-quality-report duckdb duckdb-extension pandas pandas-library polars polars-dataframe polars-extensions pyspark pyspark-dataframes
Last synced: 09 Mar 2026