An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with analytics-engineering

A curated list of projects in awesome lists tagged with analytics-engineering .

https://github.com/StructuredLabs/preswald

Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.

ai analytics analytics-engineering copilot data data-applications data-infrastructure data-pipelines data-sdk data-visualization gpt llm open-source python schema-management vscode

Last synced: 11 May 2025

https://github.com/structuredlabs/preswald

Preswald is a framework for building and deploying interactive data apps, internal tools, and dashboards with Python. With one command, you can launch, share, and deploy locally or in the cloud, turning Python scripts into powerful shareable apps.

ai analytics analytics-engineering copilot data data-applications data-infrastructure data-pipelines data-sdk data-visualization gpt llm open-source python schema-management vscode

Last synced: 13 May 2025

https://github.com/raystack/optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

airflow analytics analytics-engineering automation bigquery business-intelligence data-modelling data-pipelines data-transformation data-warehouse dataops elt etl golang workflows

Last synced: 16 May 2025

https://github.com/elementary-data/dbt-data-reliability

dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

analytics analytics-engineering data data-lineage data-observability data-pipeline-monitoring data-pipelines data-reliability dbt dbt-artifacts dbt-packages dbt-tests

Last synced: 16 May 2025

https://github.com/dbt-labs/jaffle-shop

🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

analytics analytics-engineering data data-engineering dbt dbt-cloud

Last synced: 28 Jan 2026

https://github.com/gmyrianthous/dbt-airflow

A Python package that creates fine-grained dbt tasks on Apache Airflow

airflow analytics-engineering data-engineering dbt python

Last synced: 08 Apr 2025

https://github.com/sidequery/sidemantic

A universal metrics layer. Compatible with definitions in LookML, MetricFlow, Cube with DuckDB, Snowflake, Clickhouse, Bigquery & more!

ai analytics analytics-engineering bigquery clickhouse data-engineering duckdb mcp postgresql semantic-layer snowflake spark sql visualization

Last synced: 21 Feb 2026

https://github.com/mundipagg/amora-data-build-tool

Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.

analytics analytics-dashboard analytics-engineering bigquery business-intelligence data-engineering data-modeling datacleaning dataquality elt machine-learning python transformation

Last synced: 08 Sep 2025

https://github.com/gmyrianthous/dbt-dummy

A dbt (data build tool) project you can use for testing purposes or experimentation

analytics-engineering data-engineering dbt docker sql

Last synced: 10 Jun 2025

https://github.com/dbt-labs/jaffle-shop-generator

🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.

analytics-engineering faker synthetic-data synthetic-data-generator

Last synced: 01 May 2025

https://github.com/mattiasthalen/arcane-insight

Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.

analytics-engineering data-engineering data-vault data-warehouse duckdb elt etl hearthstone medallion-architecture sqlmesh

Last synced: 11 Feb 2026

https://github.com/montara-io/dbt-command-center

Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

analytics-engineering bigquery data-analysis data-catalog data-engineering data-lineage data-observability data-pipeline data-pipelines data-validation data-warehouse dataops dbt dbt-packages elt etl orchestration python redshift

Last synced: 05 May 2025

https://github.com/kestra-io/examples

Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services

analytics-engineering automation data-engineering data-orchestration data-pipelines data-workflows orchestration

Last synced: 09 Oct 2025

https://github.com/zkan/getting-started-with-analytics-engineering

Getting Started with Analytics Engineering

analytics-engineering dbt postgresql sql

Last synced: 19 Apr 2025

https://github.com/gwenwindflower/copier-dbt

📝🖨️ A copier template for dbt projects. ⚙️🧡

analytics analytics-engineering copier-template data-engineering dbt template-project

Last synced: 15 Jun 2025

https://github.com/portovep/dbt-testing-examples

Examples of how to implement unit, component, and contract tests for dbt data apps

analytics-engineering contract-testing data-engineering dbt sql tutorial-code tutorial-exercises unit-testing

Last synced: 23 Jan 2026

https://github.com/dbt-labs/jaffle-shop-mesh-finance

A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on finance models, built on the jaffle-shop-mesh-platform project.

analytics analytics-engineering data data-engineering dbt dbt-cloud

Last synced: 28 Feb 2025

https://github.com/dbt-labs/jaffle-shop-mesh-marketing

A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on marketing models, built on the jaffle-shop-mesh-platform project.

analytics analytics-engineering data data-engineering dbt dbt-cloud

Last synced: 28 Feb 2025

https://github.com/zachpinto/real-time-indicators

Streamlit-based analytics dashboard visualizing real-time economic indicators. This project uses cron jobs to provide real-time updates of common economic indicators

analytics-engineering data-analysis plotly streamlit visualization

Last synced: 07 Sep 2025

https://github.com/hvignolo87/analytics_engineer_assignment

Resolution of the Analytics Engineering assignment of Clara

airflow analytics-engineering data-engineering dbt postgresql python sql

Last synced: 30 Jul 2025

https://github.com/gdbecker/dbtlabslearning

Learn the foundational steps of transforming data in dbt Cloud. Start by connecting dbt Cloud to a data warehouse and Git repository, then explore key concepts like modeling, sources, testing, documentation, and deployment. Get hands-on by building a model and running tests in dbt Cloud.

analytics-engineering bigquery dbt dbt-cloud jinja macros models packages sql testing

Last synced: 02 Jan 2026

https://github.com/bchaoss/trash-wheel-pipeline

dbt data pipeline for analyzing trash wheel collection data

analytics-engineering dbt duckdb elt motherduck sql tidytuesday

Last synced: 07 Oct 2025

https://github.com/aleenprd/docbt

Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.

ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit

Last synced: 11 Nov 2025

https://github.com/datalopes1/olist_dw

Data Warehouse project with OLIST Dataset

analytics-engineering dbt dbt-core python

Last synced: 11 Jun 2025

https://github.com/space-lumps/oroboro-dw-dbt

This project builds a clean, BI‑ready table user_base in BigQuery

analytics-engineering bigquery business-intelligence ci-cd data-mart data-modeling dbt elt gcp geospatial jinja metabase sql yaml

Last synced: 04 Sep 2025

https://github.com/paty-oliveira/edit-analytics-eng-postgresql

Repository for Analytics Engineering Module - EDIT academy

analytics-engineering dbt dbt-core docker-compose postgresql

Last synced: 06 Jan 2026

https://github.com/pyladiesams/data-ingestion-modern-stack-apr2025

Learn how to ingest data with dlt (data load tool) from a REST API and load it into DuckDB

analytics-engineering data-engineering data-ingestion dlt duckdb workshop

Last synced: 22 Jun 2025

https://github.com/chukwuemekaaham/ny_taxi_rides

Analytics engineering using Dbt and Google Cloud BigQuery

analytics-engineering bigquery dbt github

Last synced: 07 Jul 2025

https://github.com/alyllanes/dbt-analytics-projects

A repository for all the dbt modeling projects I'm working on to practice my skills

analytics-engineering bigquery dbt sql

Last synced: 05 Oct 2025

https://github.com/apancoast/healthcare-deserts-and-public-transit

This dbt-based project aims to analyze the intersection of healthcare accessibility and public transit coverage in Mecklenburg County, NC.

analysis analytics-engineering data-engineer dbt healthcare hpsa hrsa public-data public-transit

Last synced: 27 Oct 2025

https://github.com/zkan/analytics-engineering-with-dbt

Analytics Engineering with dbt

analytics-engineering dbt

Last synced: 05 Apr 2025

https://github.com/deepakramani/dbt-bike-insights

A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.

analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql

Last synced: 21 Jan 2026

https://github.com/goku007007/dbt-duckdb-incremental-starter

Tiny dbt + DuckDB project: seeds → staged model with tests → incremental fact table that only loads new rows. Clone-and-run.

analytics-engineering data-engineering dbt duckdb incremental-models seeds sql testing

Last synced: 02 Sep 2025

https://github.com/datalopes1/deliverycenter_dwh

Projeto de Data Warehouse utilizando o Dataset Delivery Center: Food & Goods orders in Brazil

analytics-engineering dbt dbt-core python

Last synced: 28 Jul 2025

https://github.com/deepakramani/etl_with_dbt_dwh

A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.

analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql

Last synced: 30 Mar 2025

https://github.com/shahidmalik4/analytics-pipeline-fastapi-dbt

A full-stack data analytics pipeline using DBT, FastAPI, Streamlit and Postgres. Transforms raw data into modeled tables and exposes KPIs via API endpoints, with an interactive dashboard for visualization.

analytics-engineering data-cleaning-pipeline dbt dbt-core fastapi postgresql streamlit

Last synced: 08 Sep 2025