Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
BigQuery
Google BigQuery enables companies to handle large amounts of data without having to manage infrastructure. Google’s documentation describes it as a « serverless architecture (that) lets you use SQL queries to answer your organization’s biggest questions with zero infrastructure management. BigQuery’s scalable, distributed analysis engine lets you query terabytes in seconds and petabytes in minutes. » Its client libraries allow the use of widely known languages such as Python, Java, JavaScript, and Go. Federated queries are also supported, making it flexible to read data from external sources.
📖 A highly rated canonical book on it is « Google BigQuery: The Definitive Guide », a comprehensive reference. Another enriching read on the subject is the inside story told in the article by the founding product manager of BigQuery celebrating its 10th anniversary.
- GitHub: https://github.com/topics/bigquery
- Wikipedia: https://en.wikipedia.org/wiki/BigQuery/
- Repo: https://github.com/GoogleCloudPlatform/bigquery-utils/
- Released: May 19, 2010
- Related Topics: cloud-computing,
- Aliases: bq,
- Last updated: 2025-02-05 00:03:39 UTC
- JSON Representation
https://github.com/chiamakaukwuoma/portfolio
This repository contains various projects I've been privileged to work on outside of work.
aws-rds azure-fabric bigquery data-analysis docker-container elasticsearch excel grafana hadoop looker-studio mssql mysql postgresql powerbi python sql tableau
Last synced: 03 Feb 2025
https://github.com/valenthr/purchase_funnel
Google merch store sales analysis
Last synced: 27 Jan 2025
https://github.com/owox/sgtm-owox-ga4-bigquery
OWOX BI Streaming is an advanced tracking to get the most from existing Google Analytics 4 installed on your website
Last synced: 20 Dec 2024
https://github.com/rifa8/data-warehouse-submission
Learning about Data Warehouse
bigquery citus columnar data-warehouse datalake gcs-bucket
Last synced: 27 Jan 2025
https://github.com/rifa8/extract-load-demo
Learning Google Cloud Platform (GCP)
Last synced: 27 Jan 2025
https://github.com/isaacmg/mimic_iv_bq_queries
Queries needed to recreate time series features for model training
Last synced: 21 Jan 2025
https://github.com/djdhairya/uber-data-analytics
Mage Vm
aiml api bigdata bigquery deep-learning docker google-maps-api ml python3 sql ssh vmware
Last synced: 07 Jan 2025
https://github.com/yandex-cloud-examples/yc-bigquery-to-object-storage
Экспорт данных из Google Big Query через Google Storage в Object Storage Yandex Cloud.
bigquery object-storage python3 yandex-cloud yandexcloud
Last synced: 29 Dec 2024
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 29 Dec 2024
https://github.com/iht/bigquery-dataflow-cdc-example
A Dataflow streaming pipeline written in Java, reading data from Pubsub and recovering the sessions from potentially unordered data, and upserting the session data into BigQuery with no duplicates
apache-beam bigquery cdc dataflow google-cloud pubsub
Last synced: 29 Dec 2024
https://github.com/ivanildobarauna/pypi-package-stats
Project for ingest pypi packages data from BigQuery and send to DataDog for analysis and insights with dashboards, monitors and more
bigquery cloud data-engineering data-warehouse gcp software-engineering
Last synced: 29 Dec 2024
https://github.com/sangnandar/insert-unique-record
This is Cloud Functions script to insert only unique records into BigQuery.
bigquery digital-marketing-analytics google-cloud-functions
Last synced: 29 Dec 2024
https://github.com/oliveroneill/wilt-cloud-functions
Wilt Google Cloud Functions
bigquery google-cloud-functions
Last synced: 07 Jan 2025
https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform
This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.
bigquery data data-analysis data-modeling live-streaming sql
Last synced: 27 Jan 2025
https://github.com/shegzimus/de_nasa_neow_pipeline
Airflow powered ETL pipeline for moving Near-Earth-Object data from NASA to Google Cloud
airflow-dag airflow-operator airflow-providers bigquery celery-redis docker docker-compose docker-container google-cloud-platform googlecloudstorage nasa-api
Last synced: 27 Jan 2025
https://github.com/allanreda/share-of-search-retrieval-and-visualization
Share of search analysis including data retrieval from Google Ads API, storing data in BigQuery and visualizing it in Looker Studio
bigquery google-ads-api looker-studio python share-of-search
Last synced: 28 Dec 2024
https://github.com/ackeecz/terraform-gcp-cloud-run_pubsub_to_bq
Cloud Run subscribes itself to given topic and inserts each message to BigQuery table.
Last synced: 07 Jan 2025
https://github.com/ackeecz/terraform-gcp-cloud-function_pubsub_to_bq
Cloud function subscribes itself to given topic and inserts each message to BigQuery table.
bigquery cloud-functions pubsub terraform-module
Last synced: 07 Jan 2025
https://github.com/riju18/airflow-data-engineering-with-bigquery-and-dbt
Fetch Data from a simple csv file, send the data in GCP BigQuery table, run dbt to automate the DWH and run SODA to check Data Quality.
apache-airflow bigquery csv dbt python3 soda
Last synced: 28 Jan 2025
https://github.com/eddieatgoogle/sql-based-genai-data-pipeline
GenAI data pipeline that performs data preparation, management and performance evaluation tasks for RAG systems using SQL as the primary development language. Please feel free to use this as a starting point for your own projects.
bigquery bqml dataform embeddings gemini google-cloud-platform sql vector-search vertex-ai
Last synced: 08 Jan 2025
https://github.com/oguzgn/data-science-for-business-imp
a case study for business improvment
ab-testing bigquery data-science data-visualization debugging looker marketing-analytics sheets
Last synced: 21 Jan 2025
https://github.com/humairarizwan/uber-ride-dataengineering-analysis
This project creates a pipeline to process data and performs data analytics on Uber data.
bigquery dataanalysis dataengineering gcp-project googlestorage looker-studio
Last synced: 21 Jan 2025
https://github.com/goatcheesesaladwithpeanutoildressing/scio-demo
Playing w/ Scio
Last synced: 08 Jan 2025
https://github.com/sergeimakarovv/wine-recommendation-analytics
Wine recommendation system
airflow bigquery pandas postgresql tableau
Last synced: 08 Jan 2025
https://github.com/kellyjadams/ap-exam-scores
Analyzing AP exam scores for a school.
Last synced: 08 Jan 2025
https://github.com/flowerinthenight/bqstream
A simple library to help facilitate streaming to BigQuery.
Last synced: 08 Jan 2025
https://github.com/hitthecodelabs/bigquery_ml
Jupyter notebooks that utilize Google BigQuery's machine learning capabilities.
Last synced: 04 Feb 2025
https://github.com/phukon/package-insights
PyPI package reports and insights. The data was ingested from publicly available source using BigQuery and then transformed.
Last synced: 27 Jan 2025
https://github.com/aisurjyasamantaray/-optimizing-target-s-brazilian-operations-insights-from-order-processing-pricing-and-payment-trends-
This project offers an in-depth analysis of consumer behavior, logistical performance, and payment preferences within the e-commerce sector. By examining order costs, delivery times, and payment methods, businesses can uncover valuable insights into operational efficiency and customer preferences.
bigquery consumer-insights data-analysis database sql target
Last synced: 21 Jan 2025
https://github.com/vedantwalia/google-data-analytics-capstone-case-study
This is a repository of my work on data analysis as a part of the Google Data Analytics Capstone
bigquery data data-viz datavisualization-project divvy-bikes google googledataanalytics sql tableau tableau-public
Last synced: 21 Jan 2025
https://github.com/aazuspan/landsat-bigquery
Summarizing 51 years of Landsat data using Earth Engine and BigQuery
bigquery google-earth-engine landsat
Last synced: 21 Jan 2025
https://github.com/moeabbas6/dbt_analytics_engine
An end-to-end project using dbt to demonstrate data transformations, testing, and visualization with Google BigQuery, and Looker Studio. It showcases a complete data pipeline from extraction/generation to deployment.
analytics-engineering bigquery data data-pipeline data-transformation data-visualization dbt testing
Last synced: 21 Jan 2025
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 21 Jan 2025
https://github.com/edwinrlambert/cyclistic-bike-share-analysis
This repository is part of the Google Data Analytics Capstone Project, focusing on analyzing Cyclistic's bike-sharing data to identify trends and strategies for converting casual riders to annual members. It aims to provide actionable insights for enhancing marketing efforts.
act analyze ask bigquery prepare process share sql
Last synced: 21 Jan 2025
https://github.com/lisabensoussan/bigdataminig_finalassignment
This repository contains solutions for the final assignment of the Big Data Mining course (52002/52019), focusing on querying large datasets with BigQuery, network analysis with Python, and distributed data processing with Apache Spark.
bigquery community-detection data-cleaning dataframe exploratory-data-analysis pagerank rdd sql text-analysis visualization
Last synced: 21 Jan 2025
https://github.com/karencofre/marketing-segmentacion-en-powerbi
Proyecto prueba de hipótesis en powerbi y python
bigquery google-colab powerbi python sql statsmodels
Last synced: 21 Jan 2025
https://github.com/vidyadnina/other-sql-projects-and-queries
Other SQL projects and queries.
Last synced: 21 Jan 2025
https://github.com/kartikeya443/automated-data-pipeline-gcp
This project showcases the integration of various Google Cloud Platform services to build an efficient and automated data pipeline for sales data.
bigquery cloud data-engineering flask gcp google-cloud-platform looker-studio pipeline python sql
Last synced: 21 Jan 2025
https://github.com/tomgorb/some-data-monitoring
fully functional DAG using Airflow 2 and minikube (locally) to help monitor GCP billing
airflow2 bigquery gcp minikube
Last synced: 21 Jan 2025
https://github.com/vaishnavipaithane/bellabeat-data-analysis-case-study
This capstone project was done as a part of Google Data Analytics Professional Certificate course.
bigquery data-analysis sql tableau
Last synced: 21 Jan 2025
https://github.com/manuelandersen/football-pipeline
DE Zoomcamp 2024 Final Project 🧙
bigquery data-engineering data-lake data-warehouse dbt dbt-cloud etl-pipeline google-cloud looker-studio mageai python
Last synced: 21 Jan 2025
https://github.com/ayresgneto/use-case-gcp-etl
ELT pipeline GCP. Tecnologias utilizadas: Postgresql, GCP Storage, Airflow (local), Pyspark (local), BigQuery
airflow big-data bigquery data data-engineering etl gcp pipeline postgresql programming-oriented-object pyspark python spark
Last synced: 21 Jan 2025
https://github.com/pratshrestha/cochin-traders---sql--sales-analysis
Cochin Traders imports and exports specialty foods globally. This project analyzes sales and operational data to enhance business efficiency, supply chain management, and sales performance. Key areas of focus include
bigquery customer-engagement employee-performance inventory-management sales-trends sql
Last synced: 21 Jan 2025
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 21 Jan 2025
https://github.com/shaheerazam-dev/cyclistic-case-study-google-data-analytics-certificate
This case study simulates the real-world experience of a junior data analyst at Cyclistic, a fictional company. We will leverage the data analysis process framework (Ask, Prepare, Process, Analyze, Share, Act) to address critical business questions and provide data-driven insights to guide strategic decision-making.
bigquery data-science data-visualization spreadsheet sql tableau
Last synced: 21 Jan 2025
https://github.com/anpandu/ps2bq
Stream insert GCP PubSub messages into BigQuery table.
Last synced: 21 Jan 2025
https://github.com/nghiant3110/firebase_6
This is a DA project based on the Firebase Sample dataset on Big Query
bigquery firebase looker-studio sql
Last synced: 21 Jan 2025
https://github.com/lawal-hash/olistelt
An end-to-end ELT data pipeline of the Brazilian olist e-commerce dataset using the modern data stack
airflow bigquery dbt dbt-core docker postgresql sql
Last synced: 21 Jan 2025
https://github.com/plishka/blockchain_analysis
Cryptocurrency On-Chain Analysis (Bitcoin Blockchain)
bigquery blockchain data-cleaning scraping-websites sql tableau
Last synced: 21 Jan 2025
https://github.com/cyber-programmer/web-traffic-analytics-ml-model
This Jupyter Notebook focuses on classifying website visitors using logistic regression. The project leverages Google Analytics sample data and BigQuery for data analysis and feature engineering. It provides a comprehensive workflow that includes data import, preprocessing, exploratory data analysis.
bigquery logistic-regression machine-learning
Last synced: 21 Jan 2025
https://github.com/kmohamedalie/bigquery-intro
Coursera BigQuery Introduction using Covid19 dataset
bigquery coursera covid-19 datavisualization looker-studio sql
Last synced: 21 Jan 2025
https://github.com/rubnsbarbosa/nasa-asteroids-extractor
ETL asteroids data extractor using some Google Cloud services
bigquery bucket cloud-storage google-cloud nasa-api-neows
Last synced: 21 Jan 2025
https://github.com/greatwoman23/car_insurance_analysis
The Car Insurance Analysis project aims to provide a comprehensive examination of a car insurance portfolio using advanced data analytics tools. The analysis offers valuable insights into policy demographics, claims patterns, and financial metrics, helping stakeholders make informed decisions.
bigquery data data-science dataanalytics insurance-claims looker-studio tableau
Last synced: 21 Jan 2025
https://github.com/marcopellegrinoit/web-traffic-time-series-predictions
Forecast Web Traffic Demand Time Series with ARIMA+ BigQuery and Looker Studio. Addionatel modeling available with ARIMA, LSTM, and Facebook Prophet.
arima bigquery gcp lstm prophet-model time-series vertex-ai
Last synced: 21 Jan 2025
https://github.com/denny-b-justin/purdue
The internship was broadly to understand if the topics/events are being covered differently in the different countries and how they affect stock market returns. The provided dataset is a post-processed set of news articles, so already reflects topic modelling and sentiment analysis.
big-data bigquery finance gdelt-events python
Last synced: 21 Jan 2025
https://github.com/nlgtuankiet/bq-noti
BigQuery notification
bigquery bq notification notifier
Last synced: 21 Jan 2025
https://github.com/arhea/go-mock-bigquery
Creates a mock BigQuery client based on the bigquery-emulator for testing in Golang projects.
bigquery golang golang-module google-bigquery google-cloud-platform testcontainers-go testing
Last synced: 21 Jan 2025
https://github.com/mdornseif/datastore-to-bigquery
The missing Data Transfer Tool: Dump Google Cloud Datastore contents and load them into BigQuery.
backup bigquery bigquery-backup cloud datastore google
Last synced: 21 Jan 2025
https://github.com/walterowisk/sql-learn
SQL Learning
beekeeper bigquery data-analytics dbeaver mysql sql
Last synced: 15 Jan 2025
https://github.com/hayashi-yudai/cloudfunc_login
Example of authentication function for login with Cloud Functions and BigQuery
bigquery gcp-cloud-functions golang server
Last synced: 15 Jan 2025
https://github.com/patriciavalentine/loan-data-queries
In this project, I analyzed a vehicle loan dataset using BigQuery to identify demographic, financial, and loan patterns. Through SQL queries, I extracted insights such as the credit scores, and loan distribution by region, and explored high-risk profiles. The findings are visualized in Looker Studio, thus helping to inform strategic decisions.
asset-finance bigquery loan-data looker-studio
Last synced: 04 Feb 2025
https://github.com/akansharajput280799/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 04 Feb 2025
https://github.com/janmin123/cyclistic
Capstone project for Google/Coursera Data Analytics Course
analysis bigquery sql tableau visualization
Last synced: 04 Feb 2025
https://github.com/martinkalema/bigquery-pubsub
Loading data into BigQuery Table
bigquery data-engineering flat-file kafka
Last synced: 11 Jan 2025
https://github.com/alessio-siciliano/bigquery-utils
A utility library that enhances the official BigQuery Python client with additional tools for query management, data processing, and automation, making it easier to work efficiently with Google BigQuery.
bigquery datatransfer google-cloud python
Last synced: 28 Jan 2025
https://github.com/crudek-data/bigquery-kaggle-apis
kaggle api to download free datasets along with google bigquery api to read/write from cloud data warehouse
bigquery data-engineering kaggle
Last synced: 22 Jan 2025
https://github.com/nikhilsree5/targetcasestudy
An exploratory and in-depth study of the e-commerce market in Brazil.
bigquery eda sql visualization
Last synced: 22 Jan 2025
https://github.com/coatless/bigquery-reddit-ask-your-advisor
Analysis code that counts instances of a phrase on Reddit (e.g. "ask your advisor")
ask-your-advisor bigquery r reddit
Last synced: 16 Jan 2025
https://github.com/toskpl/googlecloud
Challnege 30 days - GoogleCloud
bigquery google-cloud google-cloud-platform ml
Last synced: 14 Nov 2024
https://github.com/chukwuemekaaham/ny_taxi_rides
Analytics engineering using Dbt and Google Cloud BigQuery
analytics-engineering bigquery dbt github
Last synced: 10 Jan 2025
https://github.com/abdullahasghar/sql
The repo includes all projects and assessments I have completed with SQL. IDE/s used: MS SQL Server, Google Big Query.
Last synced: 18 Jan 2025
https://github.com/markjamesbutler/dbt-fundamentals-bigquery
Implementation of dbt fundamentals training course material using BigQuery.
bigquery dbt dbt-fundamentals fundamentals jinja2 practice-tasks sql
Last synced: 16 Jan 2025
https://github.com/garbetjie/monolog-bigquery-handler
A simple Monolog handler for writing to BigQuery.
bigquery logging monolog monolog-handler
Last synced: 16 Jan 2025
https://github.com/ngangawairimu/sales-analysis-and-customer-insights
This project features SQL queries for detailed customer and sales analysis:Customer Analysis and Sales Reporting
bigquery bigquery-dataset excel sql
Last synced: 28 Jan 2025
https://github.com/pittica/google-bigquery-helpers
Helpers for Google Cloud BigQuery.
bigquery gcp google-cloud-platform pittica
Last synced: 13 Nov 2024
https://github.com/lambdamusic/dimschema
CLI to retrieve SQL schema information about the Dimensions on Google BigQuery dataset.
bigquery dimensions python scholarly-metadata
Last synced: 12 Jan 2025
https://github.com/oleksiilatypov/google_cloud
AI & Data, Google Cloud Skills Boost
bigquery document-ai ml vertexai
Last synced: 18 Jan 2025
https://github.com/karencofre/riesgorelativo-lookerstudio
proyecto de análisis de datos y análisis perdicitvo en looker studio y google colab
bigquery data-analysis data-science machine-learning matplotlib python sklearn sql
Last synced: 22 Jan 2025
https://github.com/zenklinov/correlation-nybikers-with-weather-using-bigquery
Last synced: 22 Jan 2025
https://github.com/zborovskaanna/e-commerce-web-events-analysis
SQL project based on the Big Query public database 'The Look e-Commerce' and a dashboard in Looker Studio
analysis bigquery dashboard data-visualization looker-studio sql
Last synced: 22 Jan 2025
https://github.com/azapeti/bigquery-python-bash-automation
Since you're using the free version, you can only get data from your website through the Google Analytics API for the last 60 days. I would like to demonstrate in this repository how to run BigQuery queries in Python and automate it using bash and crontab for collecting historical data.
analytics automation bash bigquery cronjob crontab ga4 python python3
Last synced: 22 Jan 2025
https://github.com/robinnoiret/importcsv_zendeskbigquery
This project involves developing a Python script to import csv export from Zendesk to BigQuery. It is not intended for recurring use, but to enable an initial dump of historical data.
bigquery connector export-csvfile json zendesk
Last synced: 22 Jan 2025
https://github.com/acardosolima/crypto-ethereum-tokens
This project aims to create a data pipeline using Airflow to ingest dataset from Google Bigquery to a PostgreSQL database. This stack will run in a local environment using Kubernetes.
airflow bigquery postgresql python
Last synced: 22 Jan 2025
https://github.com/andre-gitdev/stocks-functions
This project is for EDA related to stock trading.
alpaca alpaca-trading-api bigquery google-cloud portfolio-optimization robinhood-api robinhood-portfolio stock-analysis stock-data stock-price-prediction stocks-api stocks-trading
Last synced: 22 Jan 2025
https://github.com/lisabensoussan/bigdata_midterm
This project focuses on analyzing Stack Overflow data related to JavaScript and Python questions using a combination of SQL queries (Google BigQuery) and Unix shell commands. The aim is to explore trends, activity patterns, and user behavior around these popular programming languages through data wrangling and querying techniques.
bigquery data-cleaning sql unix-command unix-shell
Last synced: 22 Jan 2025
https://github.com/panagiotischaviaropoulos/google-data-analytics-case-study
bigquery data-visualization sql
Last synced: 22 Jan 2025
https://github.com/thecodersstudio/node-native-test-runner
Code samples and test cases showcasing the power of Node.js's native test runner for streamlined and efficient testing.
bigquery mock nodejs nodejs-test nodenativetestrunner test
Last synced: 22 Jan 2025
https://github.com/noospheracr/twilio-segment-configs
Integration of Twilio Segment with Google BigQuery, Looker/PowerBI, and Google VertexAI to create a data-driven marketing platform
bigquery google-cloud-platform looker-studio marketing noosphera power-bi twilio-segment vertex-ai
Last synced: 22 Jan 2025
https://github.com/thanhloc81/customer-segmentation
✨ Analyze customer segments of Adventure World dataset
bigquery google-cloud powerbi sql
Last synced: 22 Jan 2025
https://github.com/jasontanx/mas-international-arrivals
Code repository about international arrivals into Malaysia
bigquery data-analytics data-engineering etl-pipeline international-arrivals
Last synced: 22 Jan 2025
https://github.com/hanif-syazul/analyzing-kimia-farma-sales-performance-with-gcp
This repository contains the final project for the Rakamin Big Data Analytics Internship. It include a complete dashboard of Kimia Farma's sales performance analysis from 2020 to 2023.
big-data-analytics bigquery internship-project kimia-farma looker-studio rakamin sql
Last synced: 22 Jan 2025
https://github.com/zeinhasan/etl-using-airflow
Extract Transform Load Using Airflow
Last synced: 22 Jan 2025