An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with gcp-dataflow

A curated list of projects in awesome lists tagged with gcp-dataflow .

https://github.com/mkuthan/example-beam

Playground for Apache Beam and Scio experiments, driven by real-world use cases.

apache-beam gcp-dataflow scala scio

Last synced: 07 Apr 2025

https://github.com/realkinetic/gcp-dataflow-gcf-trigger

Trigger a Dataflow job when a file is uploaded to Cloud Storage using a Cloud Function

dataprep gcp gcp-cloud-functions gcp-dataflow gcp-storage python

Last synced: 07 Aug 2025

https://github.com/tansudasli/beam-sandbox

Apache beam sandbox w/ Dataflow for 10+ use cases

apache-beam gcp-dataflow python

Last synced: 01 Mar 2025

https://github.com/emediongfrancis/scalable-data-processing-and-query-optimization-gcp-kafka-snowflake-airflow

This project focuses on scalable data processing and query performance optimisation. It uses Snowflake for data warehousing, GCP Cloud Functions for serverless compute, and Apache Kafka for real-time data streaming. It leverages the serverless capabilities of the systems for scalability and performance.

apache-airflow apache-kafka auto-scaling gcp-dataflow performance query-optimization scalability snowflake velocity volume

Last synced: 23 Jul 2025

https://github.com/kameshpoc/rag-data-pipeline

This repo is to demonstrate rag data processing pipeline using dataflow flex templates

cloud-storage-bucket dataflow-job embeddings flex-template gcp gcp-dataflow genai pickle-file rag rag-pipeline

Last synced: 18 Aug 2025