Projects in Awesome Lists tagged with analytics-engineering
A curated list of projects in awesome lists tagged with analytics-engineering .
https://github.com/StructuredLabs/preswald
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.
ai analytics analytics-engineering copilot data data-applications data-infrastructure data-pipelines data-sdk data-visualization gpt llm open-source python schema-management vscode
Last synced: 11 May 2025
https://github.com/structuredlabs/preswald
Preswald is a framework for building and deploying interactive data apps, internal tools, and dashboards with Python. With one command, you can launch, share, and deploy locally or in the cloud, turning Python scripts into powerful shareable apps.
ai analytics analytics-engineering copilot data data-applications data-infrastructure data-pipelines data-sdk data-visualization gpt llm open-source python schema-management vscode
Last synced: 13 May 2025
https://github.com/zinggai/zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
analytics analytics-engineering data-science data-transformation data-transformations dataengineering datalake dataquality dedupe deduplication entity-resolution etl fuzzy-matching fuzzymatch identity identity-resolution masterdata ml modern-data-stack spark
Last synced: 14 May 2025
https://github.com/raystack/optimus
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
airflow analytics analytics-engineering automation bigquery business-intelligence data-modelling data-pipelines data-transformation data-warehouse dataops elt etl golang workflows
Last synced: 16 May 2025
https://github.com/elementary-data/dbt-data-reliability
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
analytics analytics-engineering data data-lineage data-observability data-pipeline-monitoring data-pipelines data-reliability dbt dbt-artifacts dbt-packages dbt-tests
Last synced: 16 May 2025
https://github.com/tuva-health/tuva
Main repo including core data model, data marts, data quality tests, and terminology sets.
analytics-engineering bigquery data-analytics data-governance data-lineage data-pipelines data-warehouse dbt dbt-packages healthcare healthcare-analysis healthcare-data open-source redshift snowflake sql terminology
Last synced: 06 Feb 2026
https://github.com/dbt-labs/jaffle-shop
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
analytics analytics-engineering data data-engineering dbt dbt-cloud
Last synced: 28 Jan 2026
https://github.com/dbt-msft/dbt-sqlserver
dbt adapter for SQL Server and Azure SQL
analytics-engineering azure-sql azure-sql-db data-engineering dbt dbt-sqlserver microsoft microsoft-sql-server mssql sql sql-server t-sql transact-sql tsql
Last synced: 14 May 2025
https://github.com/anna-geller/dataflow-ops
Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate
analytics analytics-engineering automation aws cicd data data-engineering data-engineering-infrastructure data-engineering-pipeline data-science dataflow dataflow-ops infrastructure-as-code observability orchestration pipeline prefect python serverless
Last synced: 24 Mar 2025
https://github.com/anna-geller/prefect-dataplatform
Example repository showing how to build a data platform with Prefect, dbt and Snowflake
analytics analytics-engineering automation data-engineering data-platform data-warehousing dataflow dataflow-ops dbt orchestration prefect python snowflake sql
Last synced: 25 Oct 2025
https://github.com/gmyrianthous/dbt-airflow
A Python package that creates fine-grained dbt tasks on Apache Airflow
airflow analytics-engineering data-engineering dbt python
Last synced: 08 Apr 2025
https://github.com/sidequery/sidemantic
A universal metrics layer. Compatible with definitions in LookML, MetricFlow, Cube with DuckDB, Snowflake, Clickhouse, Bigquery & more!
ai analytics analytics-engineering bigquery clickhouse data-engineering duckdb mcp postgresql semantic-layer snowflake spark sql visualization
Last synced: 21 Feb 2026
https://github.com/mundipagg/amora-data-build-tool
Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.
analytics analytics-dashboard analytics-engineering bigquery business-intelligence data-engineering data-modeling datacleaning dataquality elt machine-learning python transformation
Last synced: 08 Sep 2025
https://github.com/gmyrianthous/dbt-dummy
A dbt (data build tool) project you can use for testing purposes or experimentation
analytics-engineering data-engineering dbt docker sql
Last synced: 10 Jun 2025
https://github.com/dbt-labs/jaffle-shop-generator
🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.
analytics-engineering faker synthetic-data synthetic-data-generator
Last synced: 01 May 2025
https://github.com/mattiasthalen/arcane-insight
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.
analytics-engineering data-engineering data-vault data-warehouse duckdb elt etl hearthstone medallion-architecture sqlmesh
Last synced: 11 Feb 2026
https://github.com/montara-io/dbt-command-center
Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
analytics-engineering bigquery data-analysis data-catalog data-engineering data-lineage data-observability data-pipeline data-pipelines data-validation data-warehouse dataops dbt dbt-packages elt etl orchestration python redshift
Last synced: 05 May 2025
https://github.com/kestra-io/examples
Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services
analytics-engineering automation data-engineering data-orchestration data-pipelines data-workflows orchestration
Last synced: 09 Oct 2025
https://github.com/sanchitvj/sports_betting_analytics_engine
A data and analytics engineering platform designed for real-time sports betting analytics.
airflow analytics-engineering aws betflow cdc data-engineering dbt druid grafana kafka lambda-architecture olap-database snowflake spark-streaming sports-betting
Last synced: 30 Oct 2025
https://github.com/zkan/getting-started-with-analytics-engineering
Getting Started with Analytics Engineering
analytics-engineering dbt postgresql sql
Last synced: 19 Apr 2025
https://github.com/anna-geller/prefect-getting-started
Get started with Prefect by scheduling your Prefect flows with GitHub Actions
analytics-engineering automation cicd data data-engineering data-engineering-infrastructure data-engineering-pipeline data-pipeline data-science dataflow dataflow-ops github-actions orchestration pipeline prefect python scheduling serverless
Last synced: 13 Jun 2025
https://github.com/gwenwindflower/copier-dbt
📝🖨️ A copier template for dbt projects. ⚙️🧡
analytics analytics-engineering copier-template data-engineering dbt template-project
Last synced: 15 Jun 2025
https://github.com/portovep/dbt-testing-examples
Examples of how to implement unit, component, and contract tests for dbt data apps
analytics-engineering contract-testing data-engineering dbt sql tutorial-code tutorial-exercises unit-testing
Last synced: 23 Jan 2026
https://github.com/dbt-labs/jaffle-shop-mesh-finance
A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on finance models, built on the jaffle-shop-mesh-platform project.
analytics analytics-engineering data data-engineering dbt dbt-cloud
Last synced: 28 Feb 2025
https://github.com/agutiernc/nyc-citi-bike-insights
Data Engineering Project using NYC Citi Bike data for years 2019, 2020, and 2023
analytics-engineering batch-processing data-engineering data-lake data-warehouse dbt dlt etl-pipeline google-cloud-storage python sql terraform
Last synced: 25 Feb 2025
https://github.com/mta-tech/seeknal
Seeknal is an all-in-one platform for data and AI/ML engineering
analytics-engineering data-engineering data-science duckdb feature-engineering feature-management feature-store machine-learning mlops
Last synced: 26 Feb 2026
https://github.com/paty-oliveira/dbt-playground
Repository for data modelling with dbt
analytics-engineering bigquery dbt docker-compose jinja postgresql python sql
Last synced: 10 Feb 2026
https://github.com/alhankeser/antren-dbt
analytics-engineering data-engineering dbt dbt-core docker sql
Last synced: 27 Mar 2025
https://github.com/moj-analytical-services/data-and-analytics-engineering-tech-radar
Visualizing our technology choices
analytics-engineering data-engineering
Last synced: 09 Apr 2025
https://github.com/dbt-labs/jaffle-shop-mesh-marketing
A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on marketing models, built on the jaffle-shop-mesh-platform project.
analytics analytics-engineering data data-engineering dbt dbt-cloud
Last synced: 28 Feb 2025
https://github.com/zachpinto/real-time-indicators
Streamlit-based analytics dashboard visualizing real-time economic indicators. This project uses cron jobs to provide real-time updates of common economic indicators
analytics-engineering data-analysis plotly streamlit visualization
Last synced: 07 Sep 2025
https://github.com/hvignolo87/analytics_engineer_assignment
Resolution of the Analytics Engineering assignment of Clara
airflow analytics-engineering data-engineering dbt postgresql python sql
Last synced: 30 Jul 2025
https://github.com/gdbecker/dbtlabslearning
Learn the foundational steps of transforming data in dbt Cloud. Start by connecting dbt Cloud to a data warehouse and Git repository, then explore key concepts like modeling, sources, testing, documentation, and deployment. Get hands-on by building a model and running tests in dbt Cloud.
analytics-engineering bigquery dbt dbt-cloud jinja macros models packages sql testing
Last synced: 02 Jan 2026
https://github.com/bchaoss/trash-wheel-pipeline
dbt data pipeline for analyzing trash wheel collection data
analytics-engineering dbt duckdb elt motherduck sql tidytuesday
Last synced: 07 Oct 2025
https://github.com/aleenprd/docbt
Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.
ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit
Last synced: 11 Nov 2025
https://github.com/datalopes1/olist_dw
Data Warehouse project with OLIST Dataset
analytics-engineering dbt dbt-core python
Last synced: 11 Jun 2025
https://github.com/space-lumps/oroboro-dw-dbt
This project builds a clean, BI‑ready table user_base in BigQuery
analytics-engineering bigquery business-intelligence ci-cd data-mart data-modeling dbt elt gcp geospatial jinja metabase sql yaml
Last synced: 04 Sep 2025
https://github.com/paty-oliveira/edit-analytics-eng-postgresql
Repository for Analytics Engineering Module - EDIT academy
analytics-engineering dbt dbt-core docker-compose postgresql
Last synced: 06 Jan 2026
https://github.com/pyladiesams/data-ingestion-modern-stack-apr2025
Learn how to ingest data with dlt (data load tool) from a REST API and load it into DuckDB
analytics-engineering data-engineering data-ingestion dlt duckdb workshop
Last synced: 22 Jun 2025
https://github.com/0x3d-site/awesome-analytics-0x3d-faqs-collection
Awesome Collection of 0x3d's Analytics FAQ's - Frequently Asked Questions
analytics analytics-engineering analytics-platform awesome awesome-list awesome-lists awesome-readme awesome-resources faqs resource resources
Last synced: 17 Apr 2025
https://github.com/chukwuemekaaham/ny_taxi_rides
Analytics engineering using Dbt and Google Cloud BigQuery
analytics-engineering bigquery dbt github
Last synced: 07 Jul 2025
https://github.com/alyllanes/dbt-analytics-projects
A repository for all the dbt modeling projects I'm working on to practice my skills
analytics-engineering bigquery dbt sql
Last synced: 05 Oct 2025
https://github.com/apancoast/healthcare-deserts-and-public-transit
This dbt-based project aims to analyze the intersection of healthcare accessibility and public transit coverage in Mecklenburg County, NC.
analysis analytics-engineering data-engineer dbt healthcare hpsa hrsa public-data public-transit
Last synced: 27 Oct 2025
https://github.com/zkan/analytics-engineering-with-dbt
Analytics Engineering with dbt
Last synced: 05 Apr 2025
https://github.com/deepakramani/dbt-bike-insights
A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.
analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql
Last synced: 21 Jan 2026
https://github.com/goku007007/dbt-duckdb-incremental-starter
Tiny dbt + DuckDB project: seeds → staged model with tests → incremental fact table that only loads new rows. Clone-and-run.
analytics-engineering data-engineering dbt duckdb incremental-models seeds sql testing
Last synced: 02 Sep 2025
https://github.com/rafaeljurkfitz/dbt-jaffle-shop
DBT project with postgres using best practices.
analytics analytics-engineering best-practices data data-engineering dbt dbt-core etl postgresql sql transform
Last synced: 15 Jun 2025
https://github.com/datalopes1/deliverycenter_dwh
Projeto de Data Warehouse utilizando o Dataset Delivery Center: Food & Goods orders in Brazil
analytics-engineering dbt dbt-core python
Last synced: 28 Jul 2025
https://github.com/deepakramani/etl_with_dbt_dwh
A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.
analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql
Last synced: 30 Mar 2025
https://github.com/shahidmalik4/analytics-pipeline-fastapi-dbt
A full-stack data analytics pipeline using DBT, FastAPI, Streamlit and Postgres. Transforms raw data into modeled tables and exposes KPIs via API endpoints, with an interactive dashboard for visualization.
analytics-engineering data-cleaning-pipeline dbt dbt-core fastapi postgresql streamlit
Last synced: 08 Sep 2025