An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with dbt

A curated list of projects in awesome lists tagged with dbt .

https://github.com/datatalksclub/data-engineering-zoomcamp

Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.

data-engineering dbt docker kafka kestra spark

Last synced: 09 Sep 2025

https://github.com/DataTalksClub/data-engineering-zoomcamp

Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.

data-engineering dbt docker kafka kestra spark

Last synced: 14 Mar 2025

https://github.com/apache/doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

bigquery database dbt delta-lake elt etl hadoop hive hudi iceberg lakehouse olap query-engine real-time redshift snowflake spark sql

Last synced: 30 Jan 2026

https://github.com/open-metadata/openmetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake

Last synced: 22 Feb 2026

https://github.com/lightdash/lightdash

Agentic BI. Analytics at the speed of code ⚡️

business-intelligence data-analytics data-visualization dbt

Last synced: 09 Jun 2026

https://github.com/open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake

Last synced: 15 Mar 2025

https://github.com/evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

analytics business-intelligence dashboard data-engineering data-science data-visualization dbt duckdb exploratory-data-analysis self-hosted sql svelte tailwindcss webassembly

Last synced: 06 Feb 2026

https://github.com/tobikodata/sqlmesh

Scalable and efficient data transformation framework - backwards compatible with dbt.

dataengineering dataops dbt elt etl python sql transformation

Last synced: 21 Jan 2026

https://github.com/TobikoData/sqlmesh

Efficient data transformation and modeling framework that is backwards compatible with dbt.

dataengineering dataops dbt elt etl python sql transformation

Last synced: 26 Mar 2025

https://github.com/elementary-data/elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

analytics-engineer bigquery data-analysis data-governance data-lineage data-observability data-pipeline data-pipelines data-reliability data-warehouse dataops dbt dbt-artifacts dbt-packages lineage redshift snowflake

Last synced: 19 May 2026

https://github.com/duckdb/dbt-duckdb

dbt adapter for DuckDB

dbt duckdb

Last synced: 18 Feb 2026

https://github.com/calogica/dbt-expectations

Port(ish) of Great Expectations to dbt test macros

dbt

Last synced: 14 May 2025

https://github.com/astronomer/astronomer-cosmos

Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code

airflow airflow-operators apache-airflow dbt python workflow

Last synced: 29 Jan 2026

https://github.com/fal-ai/dbt-fal

do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

analytics data-modeling dbt machine-learning machinelearning pandas python

Last synced: 18 Jul 2025

https://github.com/kuwala-io/kuwala

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times

admin-boundaries data data-integration data-science dbt elt google-trends jupyter kuwala no-code open-data open-source population postgres pyspark python react react-flow scraping spatial-analysis

Last synced: 30 Mar 2025

https://github.com/dbt-checkpoint/dbt-checkpoint

:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.

bigquery business-intelligence dbt pre-commit pre-commit-hook quality-assurance snowflake sql

Last synced: 06 Feb 2026

https://github.com/altimateai/vscode-dbt-power-user

This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc

ai dbt dbt-cloud dbt-core genai vscode vscode-extension

Last synced: 15 Apr 2026

https://github.com/z3z1ma/dbt-osmosis

Provides automated YAML management and a streamlit workbench. Designed to optimize dev workflows.

cli data dbt documentation editor modelling sql testing

Last synced: 14 May 2025

https://github.com/ankurchavda/streamify

A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!

airflow data-engineering dbt gcp kafka python spark

Last synced: 19 Jul 2025

https://github.com/AltimateAI/vscode-dbt-power-user

This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc

ai dbt dbt-cloud dbt-core genai vscode vscode-extension

Last synced: 02 May 2025

https://github.com/tconbeer/sqlfmt

sqlfmt formats your dbt SQL files so you don't have to

dbt formatter python sql

Last synced: 09 Oct 2025

https://github.com/dbt-labs/dbt-mcp

A MCP (Model Context Protocol) server for interacting with dbt.

data-analytics data-engineering dbt llm mcp mcp-server model-context-protocol

Last synced: 14 Apr 2026

https://github.com/Datavault-UK/automate-dv

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

data-vault dataengineering datalake datavault datavault20 datawarehouse datawarehousing dbt elt etl metadata snowflake sql

Last synced: 13 May 2025

https://github.com/datarecce/recce

The data-validation toolkit for enhanced dbt (data build tool) PR review

analytics-engineering data data-engineering data-validation dataops dbt elt

Last synced: 14 Apr 2026

https://github.com/elementary-data/dbt-data-reliability

dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

analytics analytics-engineering data data-lineage data-observability data-pipeline-monitoring data-pipelines data-reliability dbt dbt-artifacts dbt-packages dbt-tests

Last synced: 16 May 2025

https://github.com/zsvoboda/ngods-stocks

New Generation Opensource Data Stack Demo

cube dagster datahub dbt iceberg metabase python spark spark-sql trino trinodb

Last synced: 05 Apr 2025

https://github.com/gocardless/airflow-dbt

Apache Airflow integration for dbt

airflow airflow-dbt dbt

Last synced: 29 Jul 2025

https://github.com/brooklyn-data/dbt_artifacts

A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts

dbt

Last synced: 05 May 2025

https://github.com/tellery/tellery

Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.

analytics bigquery business-intelligence collaboration dashboard data-analytics data-modeling data-science data-visualization database dbt notebook self-hosted sql

Last synced: 16 May 2025

https://github.com/beehive-lab/mambo

A low-overhead dynamic binary instrumentation and modification tool for ARM (both AArch32 and AArch64 support) and RISC-V (RV64GC).

binary-analysis dbi dbm dbt instrumentation

Last synced: 29 Dec 2025

https://github.com/metriql/metriql

The metrics layer for your data. Join us at https://metriql.com/slack

analytics big-data dbt olap

Last synced: 17 Jan 2026

https://github.com/datonic/datadex

📦 Serverless and local-first Open Data Platform

dbt duckdb open-data quarto sql

Last synced: 24 Mar 2025

https://github.com/ClickHouse/dbt-clickhouse

The Clickhouse plugin for dbt (data build tool)

clickhouse database dbt

Last synced: 03 May 2025

https://github.com/clickhouse/dbt-clickhouse

The Clickhouse plugin for dbt (data build tool)

clickhouse database dbt

Last synced: 14 May 2025

https://github.com/databricks/dbt-databricks

A dbt adapter for Databricks.

databricks dbt etl sql

Last synced: 26 Feb 2026

https://github.com/datacoves/dbt-coves

CLI tool for dbt users to simplify creation of staging models (yml and sql) files

analytics bigquery datacoves dbt elt etl jinja python redshift snowflake sql

Last synced: 15 May 2025

https://github.com/dbt-labs/dbt-athena

The athena adapter plugin for dbt (https://getdbt.com)

athena dbt dbt-athena dbt-athena-community glue-catalog iceberg s3

Last synced: 16 May 2025

https://github.com/get-select/dbt-snowflake-monitoring

A dbt package from SELECT to help you monitor Snowflake performance and costs

dbt snowflake

Last synced: 28 Jan 2026

https://github.com/dbt-labs/jaffle-shop

🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

analytics analytics-engineering data data-engineering dbt dbt-cloud

Last synced: 28 Jan 2026

https://github.com/dcaribou/transfermarkt-datasets

⚽️ Extract, prepare and publish Transfermarkt datasets.

analytics dataset dbt football football-data soccer-analytics

Last synced: 02 Apr 2025

https://github.com/GokuMohandas/data-engineering

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

airflow data-engineering data-warehouse dbt etl machine-learning mlops orchestration

Last synced: 11 Sep 2025

https://github.com/cnstlungu/portable-data-stack-dagster

A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset

business-intelligence dagster data-engineering data-visualization dbt duckdb python superset

Last synced: 01 May 2025

https://github.com/gokumohandas/data-engineering

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

airflow data-engineering data-warehouse dbt etl machine-learning mlops orchestration

Last synced: 12 May 2025

https://github.com/jacopotagliabue/post-modern-stack

Joining the modern data stack with the modern ML stack

dbt metaflow mlops recommender-system sagemaker

Last synced: 17 May 2026

https://github.com/tomasfarias/airflow-dbt-python

A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.

airflow airflow-hook airflow-operators analytics data-engineering dbt python python3

Last synced: 15 May 2025

https://github.com/lightdash/dbt2looker

Generate lookml for views from dbt models

dbt looker

Last synced: 14 Dec 2025

https://github.com/omnata-labs/dbt-ml-preprocessing

A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.

bigquery dbt redshift scikit-learn snowflake

Last synced: 07 May 2025

https://github.com/slidoapp/dbt-coverage

One-stop-shop for docs and test coverage of dbt projects.

cli coverage dbt docs documentation python testing tests tool

Last synced: 29 Dec 2025

https://github.com/davidzajac1/zillacode

Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake

aws coding-interview dbt docker github-actions leetcode pandas pyspark python react snowflake spark terraform

Last synced: 04 Apr 2025

https://github.com/pragunbhutani/dbt-llm-agent

LLM based AI Agent to automate Data Analysis for dbt projects with remote MCP server

agent agentic-ai ai ai-data-analysis data-analysis data-analyst dbt llm text-to-sql

Last synced: 19 Jan 2026

https://github.com/buremba/universql

The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆

databricks dbt duckdb proxy-server snowflake sql sql-proxy sqlglot

Last synced: 12 Apr 2025

https://github.com/fivetran/dbt_ad_reporting

Fivetran's ad reporting dbt package. Combine your Facebook, Google, Pinterest, LinkedIn, Twitter, Snapchat, Microsoft, TikTok, Reddit, Amazon, and Apple Search advertising metrics using this package.

ad-reporting adwords amazon-ads apple-search-ads dbt dbt-packages facebook fivetran google-ads linkedin microsoft-ads pinterest reddit-ads snapchat twitter

Last synced: 13 Apr 2025

https://github.com/bitpicky/dbt-sugar

dbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models

data-engineering dbt dbt-sugar documentation

Last synced: 14 Jan 2026

https://github.com/ScalefreeCOM/datavault4dbt

Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.

azure-synapse bigquery datavault dbt dbt-packages exasol google-bigquery hubs links pits postgresql redshift satellites scalefree snapshots snowflake sourcemarts stagingarea

Last synced: 05 May 2025

https://github.com/valmi-io/valmi-activation

⚡ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks and SaaS tools like Klaviyo, Facebook Ads, Salesforce, Braze etc. Valmi.io Customer Data Platform (CDP) helps track and ingest user activity events from websites, shopify, serverside events. https://cloud.valmi.io

airbyte cdp composable-cdp dagster dbt duckdb ecommerce email-marketing etl event-ingestion event-tracking marketing-automation modern-data-stack open-source push-notifications reverse-etl shopify shopify-app shopify-events user-behavior

Last synced: 01 May 2025

https://github.com/scalefreecom/datavault4dbt

Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.

azure-synapse bigquery datavault dbt dbt-packages exasol google-bigquery hubs links pits postgresql redshift satellites scalefree snapshots snowflake sourcemarts stagingarea

Last synced: 20 Feb 2026

https://github.com/PicnicSupermarket/dbt-score

Linter for dbt metadata

dbt dbt-core linter metadata python

Last synced: 05 May 2025

https://github.com/picnicsupermarket/dbt-score

Linter for dbt metadata

dbt dbt-core linter metadata python

Last synced: 16 May 2025

https://github.com/emilyriederer/dbtplyr

dbt package mimicking dplyr select-helpers semantics

dbt dplyr macros sql

Last synced: 25 Feb 2026

https://github.com/dbt-labs/jaffle_shop_duckdb

Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!

data dbt duckdb sql

Last synced: 07 Apr 2025

https://github.com/nicholasyager/dbt-loom

A dbt-core plugin to weave together multi-project dbt-core deployments

dbt dbt-core plugin python3

Last synced: 24 Jan 2026

https://github.com/wvlet/wvlet

A flow-style query language for SQL engines

dbt duckdb query-language sql trino

Last synced: 28 Jan 2026

https://github.com/slidoapp/dbt-superset-lineage

Make dbt docs and Apache Superset talk to one another

cli data-lineage dbt lineage superset tool

Last synced: 30 Dec 2025

https://github.com/ftupas/dbt-spotify-analytics

Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase

dbt docker-containers metabase postgres spotify-data

Last synced: 15 Apr 2025

https://github.com/tnightengale/dbt-meta-testing

A dbt SQL package for ensuring documentation and test coverage, with granular control.

dbt dbt-artifacts dbt-fundamentals dbt-macros dbt-packages dbt-tests testing

Last synced: 05 May 2025

https://github.com/sungchun12/airflow-toolkit

Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) :desktop_computer: >> [ :rocket:, :ship: ]

actions airflow airflow-environments airflow-toolkit cloud cloud-composer composer dbt docker gcp google-cloud hcl kubernetes kubernetes-deployment python python3 shell-script terraform terragrunt terragrunt-deployment

Last synced: 01 Aug 2025

https://github.com/dbt-labs/dbt-server

A web API for dbt.

dbt

Last synced: 01 May 2025

https://github.com/dbt-labs/dbt-meshify

A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.

data dbt dbt-cloud dbt-core

Last synced: 09 Apr 2025

https://github.com/prql/dbt-prql

dbt-prql allows writing PRQL in dbt models

dbt prql sql

Last synced: 09 Apr 2025

https://github.com/re-data/dbt-re-data

re_data - fix data issues before your users & CEO would discover them 😊

data-monitoring data-observability data-quality data-testing dbt dbt-packages sql

Last synced: 07 Apr 2025

https://github.com/autotraderuk/dbt-dry-run

Dry run capability for dbt projects using BigQuery

bigquery dbt testing

Last synced: 16 May 2025

https://github.com/star3am/hashiqube

HashiQube - The Ultimate Hands on DevOps Lab running All the HashiCorp Products in a Github Codespace or a Docker Container using Vagrant or Docker Compose

airflow ansible ansible-project boundary consul dbt devops gitlab hashicorp hashiqube jenkins minikube nomad packer terraform vagrant vault waypoint

Last synced: 14 Aug 2025

https://github.com/getindata/dbt-flink-adapter

Adapter for dbt that executes dbt pipelines on Apache Flink

apache-flink data-streaming dbt streaming-analytics

Last synced: 07 May 2025

https://github.com/godatadriven/dbt-excel

[DEPRECATED] A dbt adapter for Excel.

dbt deprecated excel obselete

Last synced: 14 Dec 2025

https://github.com/codeforkjeff/dbt-sqlite

A SQLite adapter plugin for dbt (data build tool)

dbt elt etl sqlite

Last synced: 31 Jan 2026

https://github.com/PrefectHQ/prefect-dbt

Collection of Prefect integrations for working with dbt with your Prefect flows.

dbt prefect python

Last synced: 05 May 2025

https://github.com/dashlane/dbt-invoke

A CLI for creating, updating, and deleting dbt property files

cli dbt python

Last synced: 19 Jun 2025

https://github.com/Dashlane/dbt-invoke

A CLI for creating, updating, and deleting dbt property files

cli dbt python

Last synced: 05 May 2025