An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with analytics-engineering

A curated list of projects in awesome lists tagged with analytics-engineering .

https://github.com/StructuredLabs/preswald

Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.

ai analytics analytics-engineering copilot data data-applications data-infrastructure data-pipelines data-sdk data-visualization gpt llm open-source python schema-management vscode

Last synced: 11 May 2025

https://github.com/structuredlabs/preswald

Preswald is a framework for building and deploying interactive data apps, internal tools, and dashboards with Python. With one command, you can launch, share, and deploy locally or in the cloud, turning Python scripts into powerful shareable apps.

ai analytics analytics-engineering copilot data data-applications data-infrastructure data-pipelines data-sdk data-visualization gpt llm open-source python schema-management vscode

Last synced: 13 May 2025

https://github.com/raystack/optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

airflow analytics analytics-engineering automation bigquery business-intelligence data-modelling data-pipelines data-transformation data-warehouse dataops elt etl golang workflows

Last synced: 16 May 2025

https://github.com/datarecce/recce

The data-validation toolkit for enhanced dbt (data build tool) PR review

analytics-engineering data data-engineering data-validation dataops dbt elt

Last synced: 14 Apr 2026

https://github.com/elementary-data/dbt-data-reliability

dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

analytics analytics-engineering data data-lineage data-observability data-pipeline-monitoring data-pipelines data-reliability dbt dbt-artifacts dbt-packages dbt-tests

Last synced: 16 May 2025

https://github.com/dbt-labs/jaffle-shop

🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

analytics analytics-engineering data data-engineering dbt dbt-cloud

Last synced: 28 Jan 2026

https://github.com/gmyrianthous/dbt-airflow

A Python package that creates fine-grained dbt tasks on Apache Airflow

airflow analytics-engineering data-engineering dbt python

Last synced: 08 Apr 2025

https://github.com/docglow/docglow

Modern documentation site generator for dbt Core — lineage explorer, health scoring, full-text search. Live demo: https://demo.docglow.com

analytics-engineering data-catalog data-engineering dbt dbt-core documentation lineage

Last synced: 19 Apr 2026

https://github.com/sidequery/sidemantic

A universal metrics layer. Compatible with definitions in LookML, MetricFlow, Cube with DuckDB, Snowflake, Clickhouse, Bigquery & more!

ai analytics analytics-engineering bigquery clickhouse data-engineering duckdb mcp postgresql semantic-layer snowflake spark sql visualization

Last synced: 21 Feb 2026

https://github.com/mundipagg/amora-data-build-tool

Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.

analytics analytics-dashboard analytics-engineering bigquery business-intelligence data-engineering data-modeling datacleaning dataquality elt machine-learning python transformation

Last synced: 08 Sep 2025

https://github.com/gmyrianthous/dbt-dummy

A dbt (data build tool) project you can use for testing purposes or experimentation

analytics-engineering data-engineering dbt docker sql

Last synced: 10 Jun 2025

https://github.com/dbt-labs/jaffle-shop-generator

🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.

analytics-engineering faker synthetic-data synthetic-data-generator

Last synced: 01 May 2025

https://github.com/mattiasthalen/arcane-insight

Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.

analytics-engineering data-engineering data-vault data-warehouse duckdb elt etl hearthstone medallion-architecture sqlmesh

Last synced: 11 Feb 2026

https://github.com/montara-io/dbt-command-center

Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

analytics-engineering bigquery data-analysis data-catalog data-engineering data-lineage data-observability data-pipeline data-pipelines data-validation data-warehouse dataops dbt dbt-packages elt etl orchestration python redshift

Last synced: 05 May 2025

https://github.com/kestra-io/examples

Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services

analytics-engineering automation data-engineering data-orchestration data-pipelines data-workflows orchestration

Last synced: 09 Oct 2025

https://github.com/zkan/getting-started-with-analytics-engineering

Getting Started with Analytics Engineering

analytics-engineering dbt postgresql sql

Last synced: 19 Apr 2025

https://github.com/tuva-health/provider

A dbt project that transforms messy public provider datasets into usable data for the Tuva Project.

analytics-engineering data-analytics data-governance data-lineage data-pipelines data-warehouse dbt healthcare healthcare-analysis healthcare-data open-source providers snowflake sql

Last synced: 18 Mar 2026

https://github.com/dbt-labs/jaffle-shop-mesh-finance

A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on finance models, built on the jaffle-shop-mesh-platform project.

analytics analytics-engineering data data-engineering dbt dbt-cloud

Last synced: 05 Mar 2026

https://github.com/gwenwindflower/copier-dbt

📝🖨️ A copier template for dbt projects. ⚙️🧡

analytics analytics-engineering copier-template data-engineering dbt template-project

Last synced: 15 Jun 2025

https://github.com/portovep/dbt-testing-examples

Examples of how to implement unit, component, and contract tests for dbt data apps

analytics-engineering contract-testing data-engineering dbt sql tutorial-code tutorial-exercises unit-testing

Last synced: 23 Jan 2026

https://github.com/dbt-labs/jaffle-shop-mesh-marketing

A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on marketing models, built on the jaffle-shop-mesh-platform project.

analytics analytics-engineering data data-engineering dbt dbt-cloud

Last synced: 04 Mar 2026

https://github.com/pizofreude/de-zoomcamp

This repository contains learning materials and notes for the Data Engineering Zoomcamp by Datatalks.Club.

analytics-engineering batch-processing containerization-with-docker data-engineering data-ingestion data-warehouse streaming workflow-orchestration

Last synced: 05 Apr 2026

https://github.com/aleenprd/docbt

Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.

ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit

Last synced: 11 Nov 2025

https://github.com/gdbecker/dbtlabslearning

Learn the foundational steps of transforming data in dbt Cloud. Start by connecting dbt Cloud to a data warehouse and Git repository, then explore key concepts like modeling, sources, testing, documentation, and deployment. Get hands-on by building a model and running tests in dbt Cloud.

analytics-engineering bigquery dbt dbt-cloud jinja macros models packages sql testing

Last synced: 02 Jan 2026

https://github.com/hvignolo87/analytics_engineer_assignment

Resolution of the Analytics Engineering assignment of Clara

airflow analytics-engineering data-engineering dbt postgresql python sql

Last synced: 30 Jul 2025

https://github.com/bchaoss/trash-wheel-pipeline

dbt data pipeline for analyzing trash wheel collection data

analytics-engineering dbt duckdb elt motherduck sql tidytuesday

Last synced: 07 Oct 2025

https://github.com/zachpinto/real-time-indicators

Streamlit-based analytics dashboard visualizing real-time economic indicators. This project uses cron jobs to provide real-time updates of common economic indicators

analytics-engineering data-analysis plotly streamlit visualization

Last synced: 07 Sep 2025

https://github.com/alyllanes/dbt-analytics-projects

A repository for all the dbt modeling projects I'm working on to practice my skills

analytics-engineering bigquery dbt sql

Last synced: 05 Oct 2025

https://github.com/chukwuemekaaham/ny_taxi_rides

Analytics engineering using Dbt and Google Cloud BigQuery

analytics-engineering bigquery dbt github

Last synced: 07 Jul 2025

https://github.com/deepakramani/dbt-bike-insights

A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.

analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql

Last synced: 21 Jan 2026

https://github.com/space-lumps/healthcare-analytics-sql

Healthcare analytics SQL project focused on defining clean encounter-level cohorts, deriving patient and visit features, and validating data quality through explicit QA checks. The repository emphasizes reproducible, well-structured SQL, clear cohort logic, and diagnostic queries suitable for real-world healthcare or clinical analytics workflows.

analytics-engineering cohort-analysis csv csv-processing data-modeling data-quality data-validation duckdb etl healthcare-analytics normalization qa-validation sql

Last synced: 03 Mar 2026

https://github.com/goku007007/dbt-duckdb-incremental-starter

Tiny dbt + DuckDB project: seeds → staged model with tests → incremental fact table that only loads new rows. Clone-and-run.

analytics-engineering data-engineering dbt duckdb incremental-models seeds sql testing

Last synced: 02 Sep 2025

https://github.com/datalopes1/deliverycenter_dwh

Projeto de Data Warehouse utilizando o Dataset Delivery Center: Food & Goods orders in Brazil

analytics-engineering dbt dbt-core python

Last synced: 28 Jul 2025

https://github.com/apancoast/healthcare-deserts-and-public-transit

This dbt-based project aims to analyze the intersection of healthcare accessibility and public transit coverage in Mecklenburg County, NC.

analysis analytics-engineering data-engineer dbt healthcare hpsa hrsa public-data public-transit

Last synced: 27 Oct 2025

https://github.com/deepakramani/etl_with_dbt_dwh

A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.

analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql

Last synced: 30 Mar 2025

https://github.com/pyladiesams/data-ingestion-modern-stack-apr2025

Learn how to ingest data with dlt (data load tool) from a REST API and load it into DuckDB

analytics-engineering data-engineering data-ingestion dlt duckdb workshop

Last synced: 22 Jun 2025

https://github.com/paty-oliveira/edit-analytics-eng-postgresql

Repository for Analytics Engineering Module - EDIT academy

analytics-engineering dbt dbt-core docker-compose postgresql

Last synced: 06 Jan 2026

https://github.com/zkan/analytics-engineering-with-dbt

Analytics Engineering with dbt

analytics-engineering dbt

Last synced: 20 Mar 2026

https://github.com/shahidmalik4/analytics-pipeline-fastapi-dbt

A full-stack data analytics pipeline using DBT, FastAPI, Streamlit and Postgres. Transforms raw data into modeled tables and exposes KPIs via API endpoints, with an interactive dashboard for visualization.

analytics-engineering data-cleaning-pipeline dbt dbt-core fastapi postgresql streamlit

Last synced: 08 Sep 2025

https://github.com/space-lumps/oroboro-dw-dbt

This project builds a clean, BI‑ready table user_base in BigQuery

analytics-engineering bigquery business-intelligence ci-cd data-mart data-modeling dbt elt gcp geospatial jinja metabase sql yaml

Last synced: 04 Sep 2025

https://github.com/datalopes1/olist_dw

Data Warehouse project with OLIST Dataset

analytics-engineering dbt dbt-core python

Last synced: 11 Jun 2025