An open API service indexing awesome lists of open source software.

BigQuery

Google BigQuery enables companies to handle large amounts of data without having to manage infrastructure. Google’s documentation describes it as a « serverless architecture (that) lets you use SQL queries to answer your organization’s biggest questions with zero infrastructure management. BigQuery’s scalable, distributed analysis engine lets you query terabytes in seconds and petabytes in minutes. » Its client libraries allow the use of widely known languages such as Python, Java, JavaScript, and Go. Federated queries are also supported, making it flexible to read data from external sources.

📖 A highly rated canonical book on it is « Google BigQuery: The Definitive Guide », a comprehensive reference. Another enriching read on the subject is the inside story told in the article by the founding product manager of BigQuery celebrating its 10th anniversary.

https://github.com/snithish/tpc-ds_big-query

Scripts to execute TPC - DS on Big Query

benchmark bigquery tpc-ds-benchmark tpc-ds-queries

Last synced: 07 May 2025

https://github.com/sigpwned/litecene

A simple cross-data store full-text search language for Java 8+

bigquery full-text-search java query-language search

Last synced: 12 Apr 2025

https://github.com/mchmarny/pubsub-to-bigquery-pump

Simple utility combining Cloud Run and Stackdriver metrics to drain JSON messages from PubSub topic into BigQuery table

bigquery cloudrun events golang metrics pubsub stackdriver

Last synced: 21 Apr 2025

https://github.com/memsjava/bigquery-helper

A helper package for Google BigQuery operations

bigquery google pandas-dataframe

Last synced: 01 Aug 2025

https://github.com/dmytrovoytko/data-engineering-amazon-reviews

Data Engineering project for ZoomCamp`24: JSONL -> PostgreSQL/BigQuery + Metabase + Mage.AI

bash-script bigquery codespaces data-analysis data-visualization etl metabase pipeline python-script

Last synced: 31 Jul 2025

https://github.com/dp6/raft-suite-hub

O Hub é a solução responsável por centralizar a consolidação dos dados no BigQuery, ferramenta escolhida para servir de data warehouse do raft-suite.

bigquery data data-quality google-cloud google-cloud-functions hacktoberfest

Last synced: 28 Jun 2025

https://github.com/fairscript/interact

A database interaction library for node.js/JavaScript/TypeScript that uses code reflection to maximize type safety and minimize friction. Supports PostgreSQL, Google BigQuery and SQLite.

bigquery data database linq orm postgresql reflection sql sqlite typesafe

Last synced: 19 Apr 2025

https://github.com/tamanobi/bq-query-unittest

BigQueryのクエリのロジックをデータ走査量を最小限してテストできるツール

bigquery sql unittest

Last synced: 22 Jun 2025

https://github.com/doitintl/terraform-bq-scheduled-queries

This is a demo project to use Terraform to manage BigQuery scheduled queries with Cloud Build CI/CD

bigquery cicd cloudbuild terraform

Last synced: 30 Apr 2025

https://github.com/fivetran/zetasql-npm-examples

This repo contains examples of usage of zetasql-npm library

bigquery grpc sql zetasql

Last synced: 16 Apr 2025

https://github.com/corneliusweig/krew-index-tracker

Saves download statistics of `krew.dev` plugins to BigQuery

bigquery history krew krew-index statistics

Last synced: 21 Apr 2025

https://github.com/kellyjadams/run-sql-in-python

Scripts to connect python to BigQuery or a PostgreSQL database.

bigquery postgresql python

Last synced: 29 Jul 2025

https://github.com/wayfair-incubator/gbq

Python wrapper for interacting with Google BigQuery.

bigquery gbq google google-bigquery google-cloud-platform hacktoberfest python

Last synced: 03 Feb 2026

https://github.com/lixx21/airflow-dbt-gcp

A comprehensive data pipeline leveraging Airflow, DBT, Google Cloud Platform (GCP), and Docker to extract, transform, and load data seamlessly from a staging layer to a data warehouse and data mart.

airflow bigquery data-engineer dbt gcp

Last synced: 10 Apr 2025

https://github.com/tobked/fetch-apache-ga-stats

Repository to make "snapshots" of GitHub Action queue for later analysis

bigquery gcp github github-actions

Last synced: 26 Feb 2025

https://github.com/gr8distance/blanton

BigQuery API wrapped by Elixir

bigquery bigquery-schema elixir

Last synced: 25 Mar 2025

https://github.com/k1low/tbls-meta

tbls-meta is an external subcommand of tbls for applying metadata managed by tbls to the datasource.

bigquery data-catalog-management

Last synced: 01 Apr 2026

https://github.com/dataform-co/bigquery-ml-pipeline

An example of machine pipeline on Bigquery ML using Dataform

bigquery bigquery-ml dataform machine-learning-pip sql

Last synced: 19 Sep 2025

https://github.com/trocco-io/embulk-output-bigquery_java

Java flavor faster Embulk output plugin to load/insert data into Google BigQuery

bigquery embulk etl java

Last synced: 27 Feb 2026

https://github.com/kitta65/bq2cst

Parse GoogleSQL, which is a dialect of BigQuery, into a concrete syntax tree

bigquery parser rust sql

Last synced: 22 Feb 2026

https://github.com/blockchain-etl/iotex-etl

ETL (extract, transform and load) tools for ingesting IoTeX blockchain data to Google BigQuery and Pub/Sub

bigquery blockchain-data iotex sql

Last synced: 14 Apr 2025

https://github.com/trk54ylmz/spark-bigquery

Google BigQuery support for Spark SQL

bigquery spark

Last synced: 12 May 2025

https://github.com/hsbc/bqtools

The code repo for bqtools-json a package for managing bigquery using json exemplar data structure and home of the bqsync utility.

backup bigquery google-cloud

Last synced: 14 Jul 2025

https://github.com/jordicenzano/brighcove-live-ssai-ccu

POC (proof of concept) to show a possible way to calculate the real time CCU (concurrent viewers) for any Brightcove live stream with SSAI

analytics bigquery brightcove gcp-ap gcp-appengine-flex gcp-cloud-functions hls live streaming video

Last synced: 21 Aug 2025

https://github.com/e-nikitin/laravel-bigquery

Laravel BigQuery Wrapper

bigquery laravel-bigquery

Last synced: 11 Apr 2025

https://github.com/dav009/dbtmock

end to end unit tests for dbt ( Data build tool ) pipelines

bigquery data-build-tool dbt mock pipelines test testing unittest unittesting

Last synced: 27 Apr 2026

https://github.com/webocs/mining-github-microservices

Gihub mining replication package for the article "Microservices in the Wild: the Github Landscape". It's A short node program that takes a prefiltered set of github repositories (Filtered with Google BigQuery) and uses GitHub API to find the ones that have a X nubmer of stars

bigdata bigquery microservices node

Last synced: 25 Sep 2025

https://github.com/pierrec1024/airflow-provider-bigquery-reservation

Airflow provider for bigquery reservation operators.

airflow bigquery reservation

Last synced: 30 Oct 2025

https://github.com/glific/recipes

Sample projects to integrate Glific with third party services.

bigquery google-sheets nestjs nodejs postgresql

Last synced: 30 Jul 2025

https://github.com/shnewto/bqjson

bqjson - Serialize/Deserialzie BigQuery TableResults to/from JSON

bigquery java json maven serde serde-json serialization serializer tableresult testing tests

Last synced: 17 Mar 2025

https://github.com/cata-network/cadence-docs

cadence document, Chinese version

bigquery

Last synced: 11 Apr 2025

https://github.com/staffbase/bigquery-github-action

GitHub Action for writing data into BigQuery from within a Github workflow

actions bigquery github-actions

Last synced: 19 Jul 2025

https://github.com/wintermi/movielens-dataform

An example Dataform project which will use the publicly available Movielens dataset to demonstrate how to upload your product catalog and user events into either the Google Cloud Retail API or Google Cloud Discovery Engine and train a personalised product recommendation model.

bigquery dataform google-cloud google-cloud-platform vertex-ai

Last synced: 05 May 2025

https://github.com/sukanyabag/gcp-ai-notebooks

This repository contains all practice notebooks with which I performed hands-on labs in Google Cloud Training Program's "Cloud ML-AI Track"

bigquery cloudml-samples data-science dataprep tensorflow-tutorials

Last synced: 08 Feb 2026

https://github.com/lin-jun-xiang/pyga4

📊Python Google Analytics 4 (GA4) Data Extraction and Analysis Toolkit

bigquery free ga ga4 google-analytics google-analytics-python-api python

Last synced: 19 Mar 2025

https://github.com/realkinetic/codelab

Codelab for Google Cloud (BigQuery, NLP, GAE)

bigquery cloud google google-cloud natural-language-processing python

Last synced: 13 Aug 2025

https://github.com/rittmananalytics/ra_dbt_to_dataform

An open-source tool that partially automates the migration of dbt packages to Dataform

bigquery dataform dbt dbt-core migration-tool

Last synced: 20 Jun 2025

https://github.com/nodeart/koatuu-to-bigquery

load koatuu from https://data.gov.ua/dataset/dc081fb0-f504-4696-916c-a5b24312ab6e to Google BigQuey in denormalized form

bigquery google-bigquey koatuu

Last synced: 13 May 2025

https://github.com/stkchan/googleanalytics4-publicdataset-ecommerce-dashboard-powerbi

This dashboard uses Power BI Desktop as a visualization tool by extracting data from Google BigQuery.

analytics bigquery dashboard portfolio portfolio-project powerbi sql

Last synced: 12 Aug 2025

https://github.com/moh-ayman/mongodb-to-bigquery---cloud-func-etl

Google Cloud Function built to perform an ETL Job to Collect MongoDB Data and Transform it to be able to Import it to Bigquery.

bigquery etl-pipeline gcp-cloud-functions mongodb pandas-python

Last synced: 12 Apr 2025

https://github.com/ymyzk/prom2bq

Copy data from Prometheus to BigQuery

bigquery go prometheus

Last synced: 15 Mar 2025

https://github.com/adam-cowley/neo4j-bigquery

Yo dawg, I heard you like queries so we put some BigQuery in your query so you can query BigQuery from your query

bigquery cypher neo4j neo4j-procedures

Last synced: 08 May 2026

https://github.com/viant/bqwt

BigQuery Windowed Tables

bigquery etl

Last synced: 03 Aug 2025

https://github.com/wintermi/bq2csv

A command line application designed to provide a simple method to execute a BigQuery SQL script from "stdin", outputting all results to "stdout" in CSV format. A detailed log is output to the console "stderr" providing you with the available execution statistics.

bigquery google-cloud google-cloud-platform

Last synced: 30 Oct 2025

https://github.com/takegue/bqmake

BigQuery Powered Data Build Suite.

bigquery sql

Last synced: 30 Oct 2025

https://github.com/jroakes/npath

Exploring path sequences in GA4 BigQuery data

analytics bigquery pathfinding-algorithm

Last synced: 28 Apr 2026

https://github.com/bzzt/alchemy_table

Opinionated framework for working with Bigtable and BigQuery

bigquery bigtable database elixir gcp googlecloud googlecloudplatform

Last synced: 28 Jul 2025

https://github.com/cloudymoma/rustmq

Cloud-native distributed message queue with storage-compute separation architecture. Licensed under BDL 1.0 (academic use permitted, commercial use requires license).

bigquery cloud-native distributed-systems google-cloud message-queue object-storage quic rust streaming webassembly

Last synced: 19 Sep 2025

https://github.com/badal-io/dataflow-timeseries-iot-gas-demo

Dataflow code for integration with GCP Core IoT and FogLamp

bigquery dataflow foglamp

Last synced: 15 Jun 2025

https://github.com/jolares/example-gcp-dataform

Example end-to-end ELT data pipeline using GCP Dataform.

bigquery dataform etl-pipeline

Last synced: 15 Apr 2025

https://github.com/airscholar/dbt-bigquery-crash-course

A deep dive into the powerful combination of DBT and BigQuery, the game-changers in modern data engineering.

bigquery data-engineering dbt google-cloud

Last synced: 10 Apr 2025

https://github.com/phstudy/postgresql-zetasketch

ZetaSketch HLL++ functions for PostgreSQL

bigquery hll java postgresql postgresql-extension zetasketch

Last synced: 09 May 2026

https://github.com/googlecloudplatform/dcm2bq

A service for creating a JSON metadata representation for DICOM from multiple input sources and storing into Google Cloud Big Query (BQ).

bigquery dicom gcs googlecloud googlecloudplatform googlecloudstorage json

Last synced: 28 Feb 2026

https://github.com/poogles/pytest-bq

pytest fixtures for a local bigquery suitable for local development.

bigquery bigquery-emulator pytest

Last synced: 11 Feb 2026

https://github.com/42digital/bqtools

Python Tools for BigQuery

bigquery bigquery-schema migrations python

Last synced: 09 Apr 2025

https://github.com/wintermi/bqrunner

A command line application designed to provide a simple method to execute one or more SQL queries against a given dataset in BigQuery. A detailed log is output to the console providing you with the available execution statistics.

bigquery google-cloud google-cloud-platform

Last synced: 12 Jan 2026

https://github.com/adbc-drivers/bigquery

ADBC drivers for BigQuery

adbc arrow bigquery database

Last synced: 11 Mar 2026

https://github.com/yashika-malhotra/strategic-analysis-of-retail-brand-in-south-america-using-sql

Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services

bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql

Last synced: 18 Apr 2026

https://github.com/itkq/incidentio-to-bq

Load incident data from incident.io to BigQuery to analyze the MTTR etc.

bigquery incidentio

Last synced: 13 May 2026

https://github.com/googlecloudplatform/aicoe

This repository contains an end-to-end walkthrough to leverage Google Cloud services to demonstrate Solution Accelerators for few business domains

aimlops bigquery dataflow googlecloudplatform mlops security-analytics solution-accelerators vertex-ai

Last synced: 20 Oct 2025

https://github.com/greenpeace/gpes-bigquery-recipes

Google Big Query recipes to Analyse our data.

bigquery database-management sql

Last synced: 11 May 2025

https://github.com/neo4j-field/bigquery-connector

Bi-directional connectivity between Google BigQuery and Neo4j AuraDS

arrow-flight bigquery neo4j protobuf python spark

Last synced: 13 Apr 2025

https://github.com/indatawetrust/bigquery-api

A quick and easy to use package for Google Cloud BigQuery

bigdata bigquery google-cloud

Last synced: 16 Jul 2025

https://github.com/olajideolagunju/gcp_mage_data_pipeline

An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.

automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders

Last synced: 07 Mar 2025

https://github.com/42DIGITAL/bqtools

Python Tools for BigQuery

bigquery bigquery-schema migrations python

Last synced: 02 Apr 2025

https://github.com/gjbae1212/go-bqworker

go-esworker is an async worker that data can bulk insert, update to the BigQuery.

async bigquery bigquery-bulk gcp go golang parallel worker

Last synced: 26 Apr 2026

https://github.com/wintermi/fashion-dataform

An example Dataform project to load and transform the publicly available dataset from H&M Group into a format which could be imported into Discovery AI for Retail or Vertex AI Search and Conversation, , allowing you to train a retail recommendations model.

bigquery dataform google-cloud google-cloud-platform vertex-ai

Last synced: 05 May 2025

https://github.com/m-mizutani/bqs

BigQuery Schema utility in Go

bigquery bigquery-schema go

Last synced: 06 Sep 2025

https://github.com/galois1915/google-ml-engineer

This program provides the skills you need to advance your career and provides training to support your preparation for the industry-recognized Google Cloud Professional Machine Learning Engineer certification.

api automl bigquery keras mlops-workflow tensorflow2 vertex-ai

Last synced: 16 Feb 2026

https://github.com/huuvuong0912/rag-llm-based-recommender

Explore a smarter way to shop online with this full-stack project built on the infrastructure of Google Cloud Platform (GCP) for RAG based e-commerce with LLM.

bigquery fastapi langchain llm-inference llmops pyspark rag react recommender-system vertex-ai

Last synced: 13 Oct 2025

https://github.com/wintermi/bqwrite-test

A command line application designed to provide a method to test the BigQuery Streaming API or BigQuery Storage Write API, allowing you to get a view of the potential throughput available via a given host.

bigquery google-cloud google-cloud-platform

Last synced: 12 Mar 2026

https://github.com/vigneshss-07/google-cloud-professional-data-engineer-acompleteguide

This Repo contains all study, lab and supportive materials for Udemy course on "Google Cloud Professional Data Engineer - A Complete Guide".

big-data bigquery cloud-computing dataengineering elt-pipeline etl-framework gcp-services gcp-storage google-cloud machine-learning

Last synced: 06 Mar 2026

https://github.com/d8a-tech/d8a

A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.

bigquery clickhouse data ga4 tracker

Last synced: 10 Apr 2026

https://github.com/takegue/bigquery-porter

BigQuery Deployment and Metadata Management tool

bigquery

Last synced: 12 Feb 2026

https://github.com/phenrickson/bgg-data-warehouse

ETL process for BGG cloud data warehouse

bigquery data-engineering elt-pipeline python

Last synced: 15 Apr 2026

https://github.com/pcorbel/go-bigquery-acl

Simply apply ACL on BigQuery resources

acl bigquery config golang security

Last synced: 19 Apr 2026

https://github.com/wintermi/tmdb-dataform

An example Dataform project to load and transform the publicly available dataset from The Movie Database into a format which could be imported into Vertex AI Search for Media, allowing you to build a search engine for movies.

bigquery dataform google-cloud google-cloud-platform

Last synced: 07 Feb 2026

https://github.com/rezuankassim/bqanalytic

Laravel package to use analytic data imported to Big Query from Firebase Analytic

bigquery firebase-analytics laravel

Last synced: 01 Feb 2026

https://github.com/ovotech/bigquery-metrics-exporter

A Golang application to export table level metrics from BigQuery into Datadog.

bigquery company-ovo datadog

Last synced: 06 Feb 2026

https://github.com/oguzgn/firebase-ab-test-analysis-for-a-mobile-race-game

This repository showcases an infrastructure designed for analyzing A/B tests in mobile games. It leverages BigQuery to process Firebase and GA4-based event data and uses Looker Studio for dynamic visualization. The project simplifies A/B test comparisons, enabling stakeholders to view results directly through interactive dashboards.

ab-testing ab-testing-analysis bigquery event-based-tracking firebase looker-studio mobile-game-analytics race-game sql

Last synced: 19 May 2026

https://github.com/tuancamtbtx/gcp-udfs-example

Google BigQuery Javascript UDF Function Examples

bigquery gcp javascript nodejs npm udf

Last synced: 04 Jul 2025

https://github.com/t3n/gtmetrix-bq

A script running browser test of specified urls through GTmetrix and saving metrics in BigQuery.

bigquery gtmetrix

Last synced: 15 May 2026

https://github.com/seftimie/feedyext

Feedy EXT - Brand Sentiment Analysis

bigquery chrome crun gcp genai hackathon llms looker ml ondevice

Last synced: 28 Jan 2026

https://github.com/taquynhnga2001/proptech-dagster

Build an ELT pipeline with dagster and dbt to schedule loading HDB resale transactions in Singapore into Google BigQuery data warehouse, then create Power BI dashboard to enhance insight exploration.

bigquery dagster data-integration data-orchestration data-warehouse dbt elt etl powerbi python

Last synced: 14 Feb 2026

https://github.com/tuanai-vireox/gcp-udfs-example

Google BigQuery Javascript UDF Function Examples

bigquery gcp javascript nodejs npm udf

Last synced: 19 Apr 2026

https://github.com/toddbirchard/bigquery-to-sql

:bar_chart: :arrow_right: :floppy_disk: Lightweight ETL script to migrate data from BigQuery to SQL.

bigquery etl google-cloud python sql sqlalchemy

Last synced: 09 Feb 2026

https://github.com/ryanmcdowell/dataflow-bigquery-dynamic-destinations

An example pipeline for dynamically routing events from Pub/Sub to different BigQuery tables based on a message attribute.

apache-beam bigquery google-cloud-dataflow google-cloud-platform

Last synced: 09 Sep 2025

BigQuery Awesome Lists
BigQuery Categories