Projects in Awesome Lists by GoogleCloudDataproc
A curated list of projects in awesome lists by GoogleCloudDataproc .
https://github.com/googleclouddataproc/initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
Last synced: 08 Apr 2025
https://github.com/GoogleCloudDataproc/spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
bigquery bigquery-storage-api google-bigquery google-cloud google-cloud-dataproc spark
Last synced: 07 Oct 2025
https://github.com/googleclouddataproc/spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
bigquery bigquery-storage-api google-bigquery google-cloud google-cloud-dataproc spark
Last synced: 21 May 2026
https://github.com/googleclouddataproc/hadoop-connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
bigquery google-cloud-dataproc hadoop hadoop-filesystem hadoop-hcfs
Last synced: 09 Mar 2026
https://github.com/GoogleCloudDataproc/hadoop-connectors
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
bigquery google-cloud-dataproc hadoop hadoop-filesystem hadoop-hcfs
Last synced: 14 Mar 2025
https://github.com/googleclouddataproc/cloud-dataproc
Cloud Dataproc: Samples and Utils
Last synced: 16 May 2025
https://github.com/googleclouddataproc/custom-images
Tools for creating Dataproc custom images
Last synced: 09 Apr 2025
https://github.com/googleclouddataproc/flink-bigquery-connector
BigQuery connector for Apache Flink
Last synced: 16 Oct 2025
https://github.com/googleclouddataproc/spark-spanner-connector
Cloud Spanner Connector for Apache Spark
Last synced: 14 Mar 2026
https://github.com/googleclouddataproc/hive-bigquery-connector
A library enabling BigQuery as Hive storage handler
Last synced: 05 Apr 2025