Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
BigQuery
Google BigQuery enables companies to handle large amounts of data without having to manage infrastructure. Google’s documentation describes it as a « serverless architecture (that) lets you use SQL queries to answer your organization’s biggest questions with zero infrastructure management. BigQuery’s scalable, distributed analysis engine lets you query terabytes in seconds and petabytes in minutes. » Its client libraries allow the use of widely known languages such as Python, Java, JavaScript, and Go. Federated queries are also supported, making it flexible to read data from external sources.
📖 A highly rated canonical book on it is « Google BigQuery: The Definitive Guide », a comprehensive reference. Another enriching read on the subject is the inside story told in the article by the founding product manager of BigQuery celebrating its 10th anniversary.
- GitHub: https://github.com/topics/bigquery
- Wikipedia: https://en.wikipedia.org/wiki/BigQuery/
- Repo: https://github.com/GoogleCloudPlatform/bigquery-utils/
- Released: May 19, 2010
- Related Topics: cloud-computing,
- Aliases: bq,
- Last updated: 2025-01-31 00:03:17 UTC
- JSON Representation
https://github.com/humairarizwan/uber-ride-dataengineering-analysis
This project creates a pipeline to process data and performs data analytics on Uber data.
bigquery dataanalysis dataengineering gcp-project googlestorage looker-studio
Last synced: 21 Jan 2025
https://github.com/toskpl/googlecloud
Challnege 30 days - GoogleCloud
bigquery google-cloud google-cloud-platform ml
Last synced: 14 Nov 2024
https://github.com/oguzgn/data-science-for-business-imp
a case study for business improvment
ab-testing bigquery data-science data-visualization debugging looker marketing-analytics sheets
Last synced: 21 Jan 2025
https://github.com/chukwuemekaaham/ny_taxi_rides
Analytics engineering using Dbt and Google Cloud BigQuery
analytics-engineering bigquery dbt github
Last synced: 10 Jan 2025
https://github.com/abdullahasghar/sql
The repo includes all projects and assessments I have completed with SQL. IDE/s used: MS SQL Server, Google Big Query.
Last synced: 18 Jan 2025
https://github.com/eddieatgoogle/sql-based-genai-data-pipeline
GenAI data pipeline that performs data preparation, management and performance evaluation tasks for RAG systems using SQL as the primary development language. Please feel free to use this as a starting point for your own projects.
bigquery bqml dataform embeddings gemini google-cloud-platform sql vector-search vertex-ai
Last synced: 08 Jan 2025
https://github.com/markjamesbutler/dbt-fundamentals-bigquery
Implementation of dbt fundamentals training course material using BigQuery.
bigquery dbt dbt-fundamentals fundamentals jinja2 practice-tasks sql
Last synced: 16 Jan 2025
https://github.com/garbetjie/monolog-bigquery-handler
A simple Monolog handler for writing to BigQuery.
bigquery logging monolog monolog-handler
Last synced: 16 Jan 2025
https://github.com/ngangawairimu/sales-analysis-and-customer-insights
This project features SQL queries for detailed customer and sales analysis:Customer Analysis and Sales Reporting
bigquery bigquery-dataset excel sql
Last synced: 28 Jan 2025
https://github.com/riju18/airflow-data-engineering-with-bigquery-and-dbt
Fetch Data from a simple csv file, send the data in GCP BigQuery table, run dbt to automate the DWH and run SODA to check Data Quality.
apache-airflow bigquery csv dbt python3 soda
Last synced: 28 Jan 2025
https://github.com/pittica/google-bigquery-helpers
Helpers for Google Cloud BigQuery.
bigquery gcp google-cloud-platform pittica
Last synced: 13 Nov 2024
https://github.com/ackeecz/terraform-gcp-cloud-function_pubsub_to_bq
Cloud function subscribes itself to given topic and inserts each message to BigQuery table.
bigquery cloud-functions pubsub terraform-module
Last synced: 07 Jan 2025
https://github.com/ackeecz/terraform-gcp-cloud-run_pubsub_to_bq
Cloud Run subscribes itself to given topic and inserts each message to BigQuery table.
Last synced: 07 Jan 2025
https://github.com/lambdamusic/dimschema
CLI to retrieve SQL schema information about the Dimensions on Google BigQuery dataset.
bigquery dimensions python scholarly-metadata
Last synced: 12 Jan 2025
https://github.com/oleksiilatypov/google_cloud
AI & Data, Google Cloud Skills Boost
bigquery document-ai ml vertexai
Last synced: 18 Jan 2025
https://github.com/karencofre/riesgorelativo-lookerstudio
proyecto de análisis de datos y análisis perdicitvo en looker studio y google colab
bigquery data-analysis data-science machine-learning matplotlib python sklearn sql
Last synced: 22 Jan 2025
https://github.com/zenklinov/correlation-nybikers-with-weather-using-bigquery
Last synced: 22 Jan 2025
https://github.com/zborovskaanna/e-commerce-web-events-analysis
SQL project based on the Big Query public database 'The Look e-Commerce' and a dashboard in Looker Studio
analysis bigquery dashboard data-visualization looker-studio sql
Last synced: 22 Jan 2025
https://github.com/azapeti/bigquery-python-bash-automation
Since you're using the free version, you can only get data from your website through the Google Analytics API for the last 60 days. I would like to demonstrate in this repository how to run BigQuery queries in Python and automate it using bash and crontab for collecting historical data.
analytics automation bash bigquery cronjob crontab ga4 python python3
Last synced: 22 Jan 2025
https://github.com/robinnoiret/importcsv_zendeskbigquery
This project involves developing a Python script to import csv export from Zendesk to BigQuery. It is not intended for recurring use, but to enable an initial dump of historical data.
bigquery connector export-csvfile json zendesk
Last synced: 22 Jan 2025
https://github.com/acardosolima/crypto-ethereum-tokens
This project aims to create a data pipeline using Airflow to ingest dataset from Google Bigquery to a PostgreSQL database. This stack will run in a local environment using Kubernetes.
airflow bigquery postgresql python
Last synced: 22 Jan 2025
https://github.com/andre-gitdev/stocks-functions
This project is for EDA related to stock trading.
alpaca alpaca-trading-api bigquery google-cloud portfolio-optimization robinhood-api robinhood-portfolio stock-analysis stock-data stock-price-prediction stocks-api stocks-trading
Last synced: 22 Jan 2025
https://github.com/lisabensoussan/bigdata_midterm
This project focuses on analyzing Stack Overflow data related to JavaScript and Python questions using a combination of SQL queries (Google BigQuery) and Unix shell commands. The aim is to explore trends, activity patterns, and user behavior around these popular programming languages through data wrangling and querying techniques.
bigquery data-cleaning sql unix-command unix-shell
Last synced: 22 Jan 2025
https://github.com/panagiotischaviaropoulos/google-data-analytics-case-study
bigquery data-visualization sql
Last synced: 22 Jan 2025
https://github.com/thecodersstudio/node-native-test-runner
Code samples and test cases showcasing the power of Node.js's native test runner for streamlined and efficient testing.
bigquery mock nodejs nodejs-test nodenativetestrunner test
Last synced: 22 Jan 2025
https://github.com/noospheracr/twilio-segment-configs
Integration of Twilio Segment with Google BigQuery, Looker/PowerBI, and Google VertexAI to create a data-driven marketing platform
bigquery google-cloud-platform looker-studio marketing noosphera power-bi twilio-segment vertex-ai
Last synced: 22 Jan 2025
https://github.com/thanhloc81/customer-segmentation
✨ Analyze customer segments of Adventure World dataset
bigquery google-cloud powerbi sql
Last synced: 22 Jan 2025
https://github.com/jasontanx/mas-international-arrivals
Code repository about international arrivals into Malaysia
bigquery data-analytics data-engineering etl-pipeline international-arrivals
Last synced: 22 Jan 2025
https://github.com/hanif-syazul/analyzing-kimia-farma-sales-performance-with-gcp
This repository contains the final project for the Rakamin Big Data Analytics Internship. It include a complete dashboard of Kimia Farma's sales performance analysis from 2020 to 2023.
big-data-analytics bigquery internship-project kimia-farma looker-studio rakamin sql
Last synced: 22 Jan 2025
https://github.com/zeinhasan/etl-using-airflow
Extract Transform Load Using Airflow
Last synced: 22 Jan 2025
https://github.com/yasarsultan/olist_datawarehouse
An end-to-end data pipeline that extracts data, processes it, and then loads it into the BigQuery data warehouse.
airflow bigquery data-warehouse docker
Last synced: 22 Jan 2025
https://github.com/ahbiels/chatbot_analize_avaliation
Um bot feito no dialogflow cx que permite ao usuário avaliar um determinado produto da empresa. Após a avaliação, o bot ira fazer uma análise de sentimentos na avaliação do usuário, e armazenar o resultado da avaliação (juntamente com o texto da avaliação, nome do usuário e produto) dentro de um dataset no BigQuery
bigquery chatbot dataset dialogflow dialogflow-cx documentation flask gcp google-cloud iterator language-model nlu nlu-chatbot python sql
Last synced: 22 Jan 2025
https://github.com/mutaharshaik/airflow_retail_project
Airflow retail project using pipeline with BigQuery, dbt, Soda
airflow astro-cli astro-sdk bigquery datamodeling dbt docker etl-pipeline gcp snowflake soda
Last synced: 22 Jan 2025
https://github.com/victorcezeh/end-to-end-elt-pipeline
An end-to-end ELT project using the Brazilian E-Commerce dataset from Kaggle. This project demonstrates the use of Python, PostgreSQL, Docker, Docker Compose, Airflow, dbt, and BigQuery to ingest, transform, and analyze data, providing insights into sales, delivery times, and order distributions.
airflow bigquery dbt-core docker docker-compose postgresql python
Last synced: 22 Jan 2025
https://github.com/mattwelke/charter-challenge-for-fair-voting-bot
Bot that web scrapes and logs in BigQuery the donations so far of the Charter Challenge for Fair Voting.
bigquery bot go openwhisk public-data
Last synced: 22 Jan 2025
https://github.com/allanreda/share-of-search-retrieval-and-visualization
Share of search analysis including data retrieval from Google Ads API, storing data in BigQuery and visualizing it in Looker Studio
bigquery google-ads-api looker-studio python share-of-search
Last synced: 28 Dec 2024
https://github.com/lorinczakos/sql-projects
This is a collection of my SQL scripts that I wrote and were approved through my course with GoIT Romania Data Analyst course
bigquery cte data data-analysis dbeaver marketing-analytics postgresql project-repository sql vscode
Last synced: 28 Jan 2025
https://github.com/wooyakob/music-recommendation-engine
Using Gemini API to generate personalized music recommendations.
ai bigquery gemini-api google-cloud-platform
Last synced: 28 Jan 2025
https://github.com/lucashomuniz/project-20
[Dashboard] Optimizing Supermarket Operations Through Data Analytics
bigquery data-analysis data-structures data-visualization database google-cloud-platform powerbi powerbi-visuals sql sql-query
Last synced: 28 Jan 2025
https://github.com/shegzimus/de_nasa_neow_pipeline
Airflow powered ETL pipeline for moving Near-Earth-Object data from NASA to Google Cloud
airflow-dag airflow-operator airflow-providers bigquery celery-redis docker docker-compose docker-container google-cloud-platform googlecloudstorage nasa-api
Last synced: 27 Jan 2025
https://github.com/tosh2230/bigquery-table-history
Diff daily changes by BigQuery INFORMATION_SCHEMA.PARTITIONS records.
Last synced: 21 Jan 2025
https://github.com/squidmin/java11-spring-gradle-bigquery-reference
Java v11 ⋅ Spring v2 ⋅ Gradle ⋅ BigQuery
bigquery gradle gradle-java java java-gradle java11 java11-spring-boot spring spring-boot-2 spring-mvc spring-rest
Last synced: 22 Jan 2025
https://github.com/phstudy/zetasketch-bigquery-example
An example demonstrates how to use ZetaSketch with BigQuery
Last synced: 21 Jan 2025
https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform
This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.
bigquery data data-analysis data-modeling live-streaming sql
Last synced: 27 Jan 2025
https://github.com/oliveroneill/wilt-cloud-functions
Wilt Google Cloud Functions
bigquery google-cloud-functions
Last synced: 07 Jan 2025
https://github.com/ymyzk/bq-globalip
Record the current global IPv4 address to a BigQuery table.
Last synced: 28 Jan 2025
https://github.com/garbetjie/phpunit-bigquery-schema
A BigQuery schema validator constraint for BigQuery
Last synced: 28 Jan 2025
https://github.com/abel3581/consultas-bigquery-gcp
Consultas bigQuery GCP
angular bigquery gcp-bigquery java-11 spring-boot tailwindcss
Last synced: 22 Jan 2025
https://github.com/sangnandar/insert-unique-record
This is Cloud Functions script to insert only unique records into BigQuery.
bigquery digital-marketing-analytics google-cloud-functions
Last synced: 29 Dec 2024
https://github.com/vigneshss-07/complete-atoz-sql
This deals with SQL commands, interview preparation and query questions and solutions
azuresql bigquery gcp sql sql-query sql-server sqlalchemy
Last synced: 15 Nov 2024
https://github.com/night-fury-me/real-time-vehicle-data-processing
A repository that contains implementation of a Real-Time Vehicle Data Processing Pipeline that efficiently manages and analyzes vehicle data through a cohesive system.
bigquery cpp data-engineering data-streaming flink grpc kafka python real-time-data-processing
Last synced: 22 Jan 2025
https://github.com/ivanildobarauna/pypi-package-stats
Project for ingest pypi packages data from BigQuery and send to DataDog for analysis and insights with dashboards, monitors and more
bigquery cloud data-engineering data-warehouse gcp software-engineering
Last synced: 29 Dec 2024
https://github.com/jancervenka/bqcli
REPL for BigQuery
bigquery data-science gcp google python
Last synced: 31 Dec 2024
https://github.com/hackolade/bigquery
Hackolade(https://hackolade.com) plugin for BigQuery
bigquery bigquery-schema data-modeling data-models entity-relationship-diagram er-diagram nosql nosql-databases schema-design
Last synced: 17 Nov 2024
https://github.com/iht/bigquery-dataflow-cdc-example
A Dataflow streaming pipeline written in Java, reading data from Pubsub and recovering the sessions from potentially unordered data, and upserting the session data into BigQuery with no duplicates
apache-beam bigquery cdc dataflow google-cloud pubsub
Last synced: 29 Dec 2024
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 29 Dec 2024
https://github.com/digitaloptimizationgroup/digitaloptgroup-r-notebooks
A collection of R notebooks to analyze data from the Digital Optimization Group Platform
ab-testing bigquery jupyter-notebook performance-analysis r web-analytics
Last synced: 21 Jan 2025
https://github.com/ruru-lyy/nyc-taxi-service-pipeline
In this project, I built a data pipeline using Mage.ai for ETL, GCP for storage, BigQuery for querying, and Looker Studio for analytics. This project helped me learn how to process, store, and visualize data effectively using modern tools.
bigquery data-engineering data-modeling etl-pipeline looker mage-ai python
Last synced: 23 Jan 2025
https://github.com/edumoraes1/spam_count_sfmc
Consulta de SQL com contagem de envios de email e spam dos ultimos 365 dias
bigquery marketing-cloud salesforce sql
Last synced: 31 Dec 2024
https://github.com/yoshiyukikato/nightharbor-bigquery-reporter
A nightharbor reporter for GCP BigQuery
Last synced: 23 Jan 2025
https://github.com/victorcezeh/data-engineering-final-semester-portfolio
This GitHub repository serves as a comprehensive platform for managing and showcasing my data engineering projects and assessments throughout my final semester at Alt School Africa. Designed to foster collaboration, organization, and continuous improvement, this repository is the backbone of my academic journey in data engineering.
bigquery docker gcs-bucket postgresql python
Last synced: 17 Nov 2024
https://github.com/yandex-cloud-examples/yc-bigquery-to-object-storage
Экспорт данных из Google Big Query через Google Storage в Object Storage Yandex Cloud.
bigquery object-storage python3 yandex-cloud yandexcloud
Last synced: 29 Dec 2024
https://github.com/djdhairya/uber-data-analytics
Mage Vm
aiml api bigdata bigquery deep-learning docker google-maps-api ml python3 sql ssh vmware
Last synced: 07 Jan 2025
https://github.com/isaacmg/mimic_iv_bq_queries
Queries needed to recreate time series features for model training
Last synced: 21 Jan 2025
https://github.com/drvipulasharma/e-commerce-data-analysis-sql-big---query
E-Commerce-Data-Analysis-SQL-Big-Query
Last synced: 23 Jan 2025
https://github.com/davelester/gharchive-bigquery-examples
Examples Using BigQuery to Analyze GH Archive Data
Last synced: 06 Dec 2024
https://github.com/rifa8/extract-load-demo
Learning Google Cloud Platform (GCP)
Last synced: 27 Jan 2025
https://github.com/xennis/particulate-matter-sensor-storage
Store the particulate matter data from a luftdaten.info sensor in BigQuery
bigquery cloud-function luftdaten particulate-matter sensor-data
Last synced: 18 Nov 2024
https://github.com/lixx21/airflow-dbt-gcp
A comprehensive data pipeline leveraging Airflow, DBT, Google Cloud Platform (GCP), and Docker to extract, transform, and load data seamlessly from a staging layer to a data warehouse and data mart.
airflow bigquery data-engineer dbt gcp
Last synced: 29 Jan 2025
https://github.com/oguzgn/fully-automated-performance-marketing-dashboard
This project integrates data from multiple ad platforms with Google Analytics to track marketing campaigns. It uses a structured naming system and UTM tags. Data is visualized in Looker Studio dashboards to analyze campaign performance and ad spend.
bigquery data-analysis data-engineering data-modeling marketing-analytics marketing-automation marketing-data-science marketingdata sql
Last synced: 29 Jan 2025
https://github.com/gabrieladados/people-analytics
People Analytics: Insights para Retenção de Talentos
bigquery figma people-analytics sql tableau
Last synced: 29 Jan 2025
https://github.com/rifa8/data-warehouse-submission
Learning about Data Warehouse
bigquery citus columnar data-warehouse datalake gcs-bucket
Last synced: 27 Jan 2025
https://github.com/brpy/nyc-trips
Data engineering | Zoomcamp journey on nyc trip data with gcp stack
Last synced: 22 Dec 2024
https://github.com/ansh-info/stockpulse
Real-time stock market analytics pipeline with live visualization dashboard. Built with Python and GCP, featuring automated data processing and interactive Streamlit analytics.
api big-data bigquery cloud cloud-computing cloud-native data-engineering data-pipeline docker docker-compose gcp gcp-automation-gitops gcp-cloud-run gcp-pubsub google-cloud-platform real-time realtime stock-market stocks streamlit
Last synced: 27 Dec 2024
https://github.com/yiu31802/gcp-project
GCP AppEngine project of Twitter data and some sample code
appengine bigquery gcp google-bigquery google-cloud google-datastore resas twitter twitter-data twitter4j
Last synced: 07 Dec 2024
https://github.com/dobsontom/basket-abandonment
Data pipeline for detecting and responding to basket abandonment using BigQuery and Adobe Campaign.
adobe-campaign bigquery ga4 gcp sql
Last synced: 21 Nov 2024
https://github.com/niteshchawla/nc-sql-business-case
A Leading Retail chain brand and a prominent retailer in the United States. It makes itself a preferred shopping destination by offering outstanding value, inspiration, innovation and an exceptional guest experience that no other retailer can deliver.
bigquery retail sql supermarket
Last synced: 21 Jan 2025
https://github.com/erik-ingwersen-ey/iowa_sales_forecast
Iowa Liquor Sales Forecast Model
arima bigquery bigquery-ml google-cloud sales-forecast
Last synced: 21 Nov 2024
https://github.com/hrialan/dataform-prune
An open-source tool for automating the cleanup of outdated objects in Dataform configurations, optimizing data workflows with seamless CI/CD integration.
automation bigquery data-analytics dataform
Last synced: 21 Nov 2024
https://github.com/sahilmb/employee-churn-da
A data analysis project on employee churn rate using Google Bigquery, Looker, Pycaret and Colab
bigquery looker-studio pycaret
Last synced: 21 Nov 2024
https://github.com/ket0825/v1-gcp-preview
Preview 서비스를 위한 GCP 레포 / Manage GCP src for preview services
bigquery cloud-functions cloud-run cloudbuild gcp logging pubsub
Last synced: 21 Nov 2024
https://github.com/ka-zo/booking-data-analysis
Booking data analysis
airline-booking apache-beam bigquery google-cloud looker-studio python3
Last synced: 21 Nov 2024
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 21 Nov 2024
https://github.com/themihirmathur/uber-data-analytics
The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).
bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python
Last synced: 21 Nov 2024
https://github.com/thanhloc81/sql-project-bicycles-practise
✨ Utilizing SQL to extract data following a simulated task involving the Sales and Product modules
adventureworks bicycle bigquery google-cloud sql
Last synced: 21 Jan 2025
https://github.com/davidkhala/gcp-collections
Notebooks for GCP services
bigquery bq databricks datastore firestore google-cloud-platform
Last synced: 21 Nov 2024
https://github.com/owox/sgtm-owox-ga4-bigquery
OWOX BI Streaming is an advanced tracking to get the most from existing Google Analytics 4 installed on your website
Last synced: 20 Dec 2024
https://github.com/abdelnaem2002/ecommerce-analysis-dbt
Ecommerce Analysis Using Dbt
bigquery dbt dbt-cloud github looker-studio sql
Last synced: 29 Jan 2025
https://github.com/valenthr/purchase_funnel
Google merch store sales analysis
Last synced: 27 Jan 2025