An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by GoogleCloudDataproc

A curated list of projects in awesome lists by GoogleCloudDataproc .

https://github.com/googleclouddataproc/initialization-actions

Run in all nodes of your cluster before the cluster starts - lets you customize your cluster

google-cloud-dataproc

Last synced: 08 Apr 2025

https://github.com/GoogleCloudDataproc/spark-bigquery-connector

BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.

bigquery bigquery-storage-api google-bigquery google-cloud google-cloud-dataproc spark

Last synced: 07 Oct 2025

https://github.com/googleclouddataproc/spark-bigquery-connector

BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.

bigquery bigquery-storage-api google-bigquery google-cloud google-cloud-dataproc spark

Last synced: 21 May 2026

https://github.com/googleclouddataproc/hadoop-connectors

Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.

bigquery google-cloud-dataproc hadoop hadoop-filesystem hadoop-hcfs

Last synced: 09 Mar 2026

https://github.com/GoogleCloudDataproc/hadoop-connectors

Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.

bigquery google-cloud-dataproc hadoop hadoop-filesystem hadoop-hcfs

Last synced: 14 Mar 2025

https://github.com/googleclouddataproc/cloud-dataproc

Cloud Dataproc: Samples and Utils

google-cloud-dataproc

Last synced: 16 May 2025

https://github.com/googleclouddataproc/custom-images

Tools for creating Dataproc custom images

google-cloud-dataproc

Last synced: 09 Apr 2025

https://github.com/googleclouddataproc/flink-bigquery-connector

BigQuery connector for Apache Flink

Last synced: 16 Oct 2025

https://github.com/googleclouddataproc/hive-bigquery-storage-handler

Hive Storage Handler for interoperability between BigQuery and Apache Hive

apache bigquery gcp google hadoop hive

Last synced: 16 Mar 2026

https://github.com/googleclouddataproc/spark-spanner-connector

Cloud Spanner Connector for Apache Spark

Last synced: 14 Mar 2026

https://github.com/googleclouddataproc/hive-bigquery-connector

A library enabling BigQuery as Hive storage handler

Last synced: 05 Apr 2025