Projects in Awesome Lists tagged with dbt
A curated list of projects in awesome lists tagged with dbt .
https://github.com/datatalksclub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
data-engineering dbt docker kafka kestra spark
Last synced: 09 Sep 2025
https://github.com/DataTalksClub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
data-engineering dbt docker kafka kestra spark
Last synced: 14 Mar 2025
https://github.com/mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
artificial-intelligence data data-engineering data-integration data-pipelines data-science dbt elt etl machine-learning orchestration pipeline pipelines python reverse-etl spark sql transformation
Last synced: 21 Jan 2026
https://github.com/open-metadata/openmetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 22 Feb 2026
https://github.com/lightdash/lightdash
Agentic BI. Analytics at the speed of code ⚡️
business-intelligence data-analytics data-visualization dbt
Last synced: 09 Jun 2026
https://github.com/open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 15 Mar 2025
https://github.com/evidence-dev/evidence
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
analytics business-intelligence dashboard data-engineering data-science data-visualization dbt duckdb exploratory-data-analysis self-hosted sql svelte tailwindcss webassembly
Last synced: 06 Feb 2026
https://github.com/datafold/data-diff
Compare tables within or across databases
data data-diffing data-engineering data-quality data-quality-monitoring data-science database databricks-sql dataengineering dataquality dbt mysql oracle-database postgres postgresql python rdbms snowflake sql trino
Last synced: 24 Mar 2025
https://github.com/tobikodata/sqlmesh
Scalable and efficient data transformation framework - backwards compatible with dbt.
dataengineering dataops dbt elt etl python sql transformation
Last synced: 21 Jan 2026
https://github.com/TobikoData/sqlmesh
Efficient data transformation and modeling framework that is backwards compatible with dbt.
dataengineering dataops dbt elt etl python sql transformation
Last synced: 26 Mar 2025
https://github.com/sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
data-contracts data-engineering data-governance data-monitoring data-observability data-profiling data-quality data-quality-checks data-quality-monitoring data-quality-testing data-reliability data-testing data-unit-tests data-validation dataquality datatesting dbt pipeline-testing python snowflake
Last synced: 14 May 2025
https://github.com/elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
analytics-engineer bigquery data-analysis data-governance data-lineage data-observability data-pipeline data-pipelines data-reliability data-warehouse dataops dbt dbt-artifacts dbt-packages lineage redshift snowflake
Last synced: 19 May 2026
https://github.com/multiwoven/multiwoven
🔥🔥🔥 Open source composable CDP - alternative to hightouch and census.
bigquery cdp customer-data-platform data-activation data-engineering data-pipeline data-warehouse databricks dbt etl hacktoberfest open-source postresql react redshift reverse-etl ruby self-hosted snowflake typescript
Last synced: 13 May 2025
https://github.com/Multiwoven/multiwoven
🔥🔥🔥 Open source composable CDP - alternative to hightouch and census.
bigquery cdp customer-data-platform data-activation data-engineering data-pipeline data-warehouse databricks dbt etl hacktoberfest open-source postresql react redshift reverse-etl ruby self-hosted snowflake typescript
Last synced: 01 Apr 2025
https://github.com/re-data/re-data
re_data - fix data issues before your users & CEO would discover them 😊
data-analysis data-monitoring data-observability data-quality data-quality-checks data-quality-monitoring data-reliability data-testing dataquality dbt dbt-packages open-source-tooling
Last synced: 14 May 2025
https://github.com/calogica/dbt-expectations
Port(ish) of Great Expectations to dbt test macros
Last synced: 14 May 2025
https://github.com/astronomer/astronomer-cosmos
Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code
airflow airflow-operators apache-airflow dbt python workflow
Last synced: 29 Jan 2026
https://github.com/fal-ai/dbt-fal
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
analytics data-modeling dbt machine-learning machinelearning pandas python
Last synced: 18 Jul 2025
https://github.com/kuwala-io/kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
admin-boundaries data data-integration data-science dbt elt google-trends jupyter kuwala no-code open-data open-source population postgres pyspark python react react-flow scraping spatial-analysis
Last synced: 30 Mar 2025
https://github.com/dbt-checkpoint/dbt-checkpoint
:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
bigquery business-intelligence dbt pre-commit pre-commit-hook quality-assurance snowflake sql
Last synced: 06 Feb 2026
https://github.com/gouline/dbt-metabase
dbt + Metabase integration
analytics business-intelligence data data-modelling dbt elt metabase pypa python vizualisation
Last synced: 19 Apr 2026
https://github.com/altimateai/vscode-dbt-power-user
This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc
ai dbt dbt-cloud dbt-core genai vscode vscode-extension
Last synced: 15 Apr 2026
https://github.com/z3z1ma/dbt-osmosis
Provides automated YAML management and a streamlit workbench. Designed to optimize dev workflows.
cli data dbt documentation editor modelling sql testing
Last synced: 14 May 2025
https://github.com/ankurchavda/streamify
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
airflow data-engineering dbt gcp kafka python spark
Last synced: 19 Jul 2025
https://github.com/AltimateAI/vscode-dbt-power-user
This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc
ai dbt dbt-cloud dbt-core genai vscode vscode-extension
Last synced: 02 May 2025
https://github.com/infuseai/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 10 Apr 2025
https://github.com/tconbeer/sqlfmt
sqlfmt formats your dbt SQL files so you don't have to
Last synced: 09 Oct 2025
https://github.com/InfuseAI/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 18 Apr 2025
https://github.com/dbt-labs/dbt-mcp
A MCP (Model Context Protocol) server for interacting with dbt.
data-analytics data-engineering dbt llm mcp mcp-server model-context-protocol
Last synced: 14 Apr 2026
https://github.com/Datavault-UK/automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
data-vault dataengineering datalake datavault datavault20 datawarehouse datawarehousing dbt elt etl metadata snowflake sql
Last synced: 13 May 2025
https://github.com/datarecce/recce
The data-validation toolkit for enhanced dbt (data build tool) PR review
analytics-engineering data data-engineering data-validation dataops dbt elt
Last synced: 14 Apr 2026
https://github.com/elementary-data/dbt-data-reliability
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
analytics analytics-engineering data data-lineage data-observability data-pipeline-monitoring data-pipelines data-reliability dbt dbt-artifacts dbt-packages dbt-tests
Last synced: 16 May 2025
https://github.com/gocardless/airflow-dbt
Apache Airflow integration for dbt
Last synced: 29 Jul 2025
https://github.com/faros-ai/faros-community-edition
BI, API and Automation layer for your Engineering Operations data
airbyte analytics business-analytics business-intelligence cicd dbt developer-productivity devops dora engineering-operations engops engprod faros hasura incident-management metrics sdlc space tasks-management version-control
Last synced: 12 Jun 2025
https://github.com/brooklyn-data/dbt_artifacts
A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts
Last synced: 05 May 2025
https://github.com/tellery/tellery
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
analytics bigquery business-intelligence collaboration dashboard data-analytics data-modeling data-science data-visualization database dbt notebook self-hosted sql
Last synced: 16 May 2025
https://github.com/beehive-lab/mambo
A low-overhead dynamic binary instrumentation and modification tool for ARM (both AArch32 and AArch64 support) and RISC-V (RV64GC).
binary-analysis dbi dbm dbt instrumentation
Last synced: 29 Dec 2025
https://github.com/data-drift/data-drift
Metrics Observability & Troubleshooting
analytics bigquery context data-diffing data-governance data-lineage data-monitoring data-observability data-quality data-reliability data-version-control dbt dbt-metrics dbt-packages drill-down metrics reconciliation redshift semantic-layer snowflake
Last synced: 08 Oct 2025
https://github.com/metriql/metriql
The metrics layer for your data. Join us at https://metriql.com/slack
Last synced: 17 Jan 2026
https://github.com/ClickHouse/dbt-clickhouse
The Clickhouse plugin for dbt (data build tool)
Last synced: 03 May 2025
https://github.com/clickhouse/dbt-clickhouse
The Clickhouse plugin for dbt (data build tool)
Last synced: 14 May 2025
https://github.com/tuva-health/tuva
Main repo including core data model, data marts, data quality tests, and terminology sets.
analytics-engineering bigquery data-analytics data-governance data-lineage data-pipelines data-warehouse dbt dbt-packages healthcare healthcare-analysis healthcare-data open-source redshift snowflake sql terminology
Last synced: 06 Feb 2026
https://github.com/dbt-labs/dbt-athena
The athena adapter plugin for dbt (https://getdbt.com)
athena dbt dbt-athena dbt-athena-community glue-catalog iceberg s3
Last synced: 16 May 2025
https://github.com/get-select/dbt-snowflake-monitoring
A dbt package from SELECT to help you monitor Snowflake performance and costs
Last synced: 28 Jan 2026
https://github.com/dbt-labs/jaffle-shop
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
analytics analytics-engineering data data-engineering dbt dbt-cloud
Last synced: 28 Jan 2026
https://github.com/dbt-msft/dbt-sqlserver
dbt adapter for SQL Server and Azure SQL
analytics-engineering azure-sql azure-sql-db data-engineering dbt dbt-sqlserver microsoft microsoft-sql-server mssql sql sql-server t-sql transact-sql tsql
Last synced: 14 May 2025
https://github.com/dcaribou/transfermarkt-datasets
⚽️ Extract, prepare and publish Transfermarkt datasets.
analytics dataset dbt football football-data soccer-analytics
Last synced: 02 Apr 2025
https://github.com/GokuMohandas/data-engineering
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
airflow data-engineering data-warehouse dbt etl machine-learning mlops orchestration
Last synced: 11 Sep 2025
https://github.com/cnstlungu/portable-data-stack-dagster
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
business-intelligence dagster data-engineering data-visualization dbt duckdb python superset
Last synced: 01 May 2025
https://github.com/gokumohandas/data-engineering
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
airflow data-engineering data-warehouse dbt etl machine-learning mlops orchestration
Last synced: 12 May 2025
https://github.com/jacopotagliabue/post-modern-stack
Joining the modern data stack with the modern ML stack
dbt metaflow mlops recommender-system sagemaker
Last synced: 17 May 2026
https://github.com/tomasfarias/airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
airflow airflow-hook airflow-operators analytics data-engineering dbt python python3
Last synced: 15 May 2025
https://github.com/lightdash/dbt2looker
Generate lookml for views from dbt models
Last synced: 14 Dec 2025
https://github.com/omnata-labs/dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
bigquery dbt redshift scikit-learn snowflake
Last synced: 07 May 2025
https://github.com/slidoapp/dbt-coverage
One-stop-shop for docs and test coverage of dbt projects.
cli coverage dbt docs documentation python testing tests tool
Last synced: 29 Dec 2025
https://github.com/davidzajac1/zillacode
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
aws coding-interview dbt docker github-actions leetcode pandas pyspark python react snowflake spark terraform
Last synced: 04 Apr 2025
https://github.com/pragunbhutani/dbt-llm-agent
LLM based AI Agent to automate Data Analysis for dbt projects with remote MCP server
agent agentic-ai ai ai-data-analysis data-analysis data-analyst dbt llm text-to-sql
Last synced: 19 Jan 2026
https://github.com/buremba/universql
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
databricks dbt duckdb proxy-server snowflake sql sql-proxy sqlglot
Last synced: 12 Apr 2025
https://github.com/josephmachado/simple_dbt_project
Code for dbt tutorial
analytics datawarehouse dbt postgres tutorial
Last synced: 15 Apr 2025
https://github.com/fivetran/dbt_ad_reporting
Fivetran's ad reporting dbt package. Combine your Facebook, Google, Pinterest, LinkedIn, Twitter, Snapchat, Microsoft, TikTok, Reddit, Amazon, and Apple Search advertising metrics using this package.
ad-reporting adwords amazon-ads apple-search-ads dbt dbt-packages facebook fivetran google-ads linkedin microsoft-ads pinterest reddit-ads snapchat twitter
Last synced: 13 Apr 2025
https://github.com/bitpicky/dbt-sugar
dbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
data-engineering dbt dbt-sugar documentation
Last synced: 14 Jan 2026
https://github.com/ScalefreeCOM/datavault4dbt
Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.
azure-synapse bigquery datavault dbt dbt-packages exasol google-bigquery hubs links pits postgresql redshift satellites scalefree snapshots snowflake sourcemarts stagingarea
Last synced: 05 May 2025
https://github.com/valmi-io/valmi-activation
⚡ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks and SaaS tools like Klaviyo, Facebook Ads, Salesforce, Braze etc. Valmi.io Customer Data Platform (CDP) helps track and ingest user activity events from websites, shopify, serverside events. https://cloud.valmi.io
airbyte cdp composable-cdp dagster dbt duckdb ecommerce email-marketing etl event-ingestion event-tracking marketing-automation modern-data-stack open-source push-notifications reverse-etl shopify shopify-app shopify-events user-behavior
Last synced: 01 May 2025
https://github.com/scalefreecom/datavault4dbt
Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.
azure-synapse bigquery datavault dbt dbt-packages exasol google-bigquery hubs links pits postgresql redshift satellites scalefree snapshots snowflake sourcemarts stagingarea
Last synced: 20 Feb 2026
https://github.com/emilyriederer/dbtplyr
dbt package mimicking dplyr select-helpers semantics
Last synced: 25 Feb 2026
https://github.com/dbt-labs/jaffle_shop_duckdb
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
Last synced: 07 Apr 2025
https://github.com/nicholasyager/dbt-loom
A dbt-core plugin to weave together multi-project dbt-core deployments
Last synced: 24 Jan 2026
https://github.com/wvlet/wvlet
A flow-style query language for SQL engines
dbt duckdb query-language sql trino
Last synced: 28 Jan 2026
https://github.com/slidoapp/dbt-superset-lineage
Make dbt docs and Apache Superset talk to one another
cli data-lineage dbt lineage superset tool
Last synced: 30 Dec 2025
https://github.com/ftupas/dbt-spotify-analytics
Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase
dbt docker-containers metabase postgres spotify-data
Last synced: 15 Apr 2025
https://github.com/tnightengale/dbt-meta-testing
A dbt SQL package for ensuring documentation and test coverage, with granular control.
dbt dbt-artifacts dbt-fundamentals dbt-macros dbt-packages dbt-tests testing
Last synced: 05 May 2025
https://github.com/sungchun12/airflow-toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) :desktop_computer: >> [ :rocket:, :ship: ]
actions airflow airflow-environments airflow-toolkit cloud cloud-composer composer dbt docker gcp google-cloud hcl kubernetes kubernetes-deployment python python3 shell-script terraform terragrunt terragrunt-deployment
Last synced: 01 Aug 2025
https://github.com/dbt-labs/terraform-provider-dbtcloud
dbt Cloud Terraform Provider
dbt dbt-cloud terraform terraform-provider
Last synced: 14 Apr 2026
https://github.com/dbt-labs/dbt-meshify
A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.
Last synced: 09 Apr 2025
https://github.com/prql/dbt-prql
dbt-prql allows writing PRQL in dbt models
Last synced: 09 Apr 2025
https://github.com/trannhatnguyen2/nyc_taxi_data_pipeline
Nyc_Taxi_Data_Pipeline - DE Project
airflow dbt debezium docker great-expectations kafka minio postgresql spark trino
Last synced: 06 Apr 2025
https://github.com/anna-geller/prefect-dataplatform
Example repository showing how to build a data platform with Prefect, dbt and Snowflake
analytics analytics-engineering automation data-engineering data-platform data-warehousing dataflow dataflow-ops dbt orchestration prefect python snowflake sql
Last synced: 25 Oct 2025
https://github.com/re-data/dbt-re-data
re_data - fix data issues before your users & CEO would discover them 😊
data-monitoring data-observability data-quality data-testing dbt dbt-packages sql
Last synced: 07 Apr 2025
https://github.com/autotraderuk/dbt-dry-run
Dry run capability for dbt projects using BigQuery
Last synced: 16 May 2025
https://github.com/star3am/hashiqube
HashiQube - The Ultimate Hands on DevOps Lab running All the HashiCorp Products in a Github Codespace or a Docker Container using Vagrant or Docker Compose
airflow ansible ansible-project boundary consul dbt devops gitlab hashicorp hashiqube jenkins minikube nomad packer terraform vagrant vault waypoint
Last synced: 14 Aug 2025
https://github.com/getindata/dbt-flink-adapter
Adapter for dbt that executes dbt pipelines on Apache Flink
apache-flink data-streaming dbt streaming-analytics
Last synced: 07 May 2025
https://github.com/godatadriven/dbt-excel
[DEPRECATED] A dbt adapter for Excel.
Last synced: 14 Dec 2025
https://github.com/yu-iskw/dbt-artifacts-parser
A dbt artifacts parser in python
dbt dbt-artifacts parser pypi-package python python-package
Last synced: 16 May 2025
https://github.com/runprism/prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
bigquery data data-analysis data-engineering data-integration data-orc data-science dbt etl etl-pipeline machine-learning orchestration pipeline postgres python redshift snowflake trino
Last synced: 19 May 2026
https://github.com/codeforkjeff/dbt-sqlite
A SQLite adapter plugin for dbt (data build tool)
Last synced: 31 Jan 2026
https://github.com/PrefectHQ/prefect-dbt
Collection of Prefect integrations for working with dbt with your Prefect flows.
Last synced: 05 May 2025
https://github.com/GClunies/Reflekt
Define, govern, and model event data for warehouse-first product analytics.
avo customer-data-platform data-modeling data-quality data-warehouse dbt dbt-package events governance product-analytics schema-registry segment segment-protocols
Last synced: 02 Sep 2025
https://github.com/gclunies/reflekt
Define, govern, and model event data for warehouse-first product analytics.
avo customer-data-platform data-modeling data-quality data-warehouse dbt dbt-package events governance product-analytics schema-registry segment segment-protocols
Last synced: 12 Apr 2025
https://github.com/dashlane/dbt-invoke
A CLI for creating, updating, and deleting dbt property files
Last synced: 19 Jun 2025
https://github.com/Dashlane/dbt-invoke
A CLI for creating, updating, and deleting dbt property files
Last synced: 05 May 2025