Projects in Awesome Lists tagged with spark-datasource
A curated list of projects in awesome lists tagged with spark-datasource .
https://github.com/stabrise/spark-pdf
PDF DataSource for Apache Spark
big-data data-engineering data-extraction data-science ocr ocr-recognition pdf pdf-document pdf-document-processor spark spark-datasource tesseract tesseract-ocr
Last synced: 09 Apr 2025
https://github.com/miraisolutions/spark-bigquery
Google BigQuery data source for Apache Spark
bigquery google-dataproc spark spark-datasource
Last synced: 04 Sep 2025
https://github.com/huggingface/pyspark_huggingface
PySpark custom data source for Hugging Face Datasets
datasets datasource huggingface huggingface-datasets spark spark-datasource
Last synced: 14 Oct 2025
https://github.com/rejeb/netcdf-spark-parser
Scala/Spark Netcdf for reading Netcdf files
netcdf netcdf-java parser scala spark spark-connector spark-datasource
Last synced: 16 Apr 2026