An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with dimensional-modeling

A curated list of projects in awesome lists tagged with dimensional-modeling .

https://github.com/mattiasthalen/adventure-works

Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, event-enhanced Puppini bridges, and temporal resolution across DAS/DAB/DAR layers.

analytical-data-storage-system data-architecture data-engineering data-modeling data-warehouse dimensional-modeling duckdb hook-methodology iceberg lakehouse serverless sqlmesh unified-star-schema

Last synced: 11 Feb 2026

https://github.com/mchenryspagg/hng-hire-data-model

The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.

dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema

Last synced: 11 Feb 2026

https://github.com/datalopes1/machine_stop

Projeto end-to-end de data analytics.

data-warehouse dbt dimensional-modeling duckdb postgresql python

Last synced: 16 Apr 2026

https://github.com/mchenryspagg/creating-a-dimensional-data-model

This project involves creating a dimensional data model using MySQL Workbench for a car repair shop’s operations in western Canada by examining a sample invoice,

dataanalysis dimensional-modeling erdiagram mysql mysql-database salesanalysis

Last synced: 08 Oct 2025

https://github.com/hase3b/end-to-end-dwh-pipeline

This repository contains the end-to-end pipeline for building a data warehouse for a real estate management company. The pipeline includes data generation, ETL process, creation of star schema dimensions and fact table, visualization using Power BI, and automation with Pabbly Connect.

api automation dashboard data-engineering data-generation data-pipeline data-warehousing database-schema dimensional-modeling eda erd etl-pipeline mockaroo pabbly powerbi relational-database star-schema workflow

Last synced: 04 Apr 2025

https://github.com/deepakramani/dbt-bike-insights

A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.

analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql

Last synced: 21 Jan 2026

https://github.com/subhamay-bhattacharyya/customer360-snowflake-pipeline

End-to-end Snowflake data lake for a fictional retail bank — ingests 25,000 nested JSON customer records through a RAW → SILVER → GOLD pipeline, builds fact and dimension tables, and surfaces a Customer 360 & risk analytics dashboard via Streamlit in Snowflake.

banking-analytics cloud-championship cortex customer-360 data-engineering data-lake data-warehouse dimensional-modeling financial-services fintech lateral-flatten medallion-architecture python snowflake sql star-schema star-schemas streamlit streamlit-app

Last synced: 15 Apr 2026

https://github.com/maxinexiong/cloud-data-warehousing-with-aws-redshift

This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.

aws-boto3 aws-redshift aws-s3 cloud-data-warehouse data-engineering data-warehouse data-warehousing dimensional-model dimensional-modeling etl etl-pipeline extract-transform-load infrastructure-as-code postgresql postgresql-database redshift-cluster

Last synced: 27 Feb 2026

https://github.com/rihua-tech/nyc-311-service-requests-lakehouse

Azure-first medallion lakehouse for NYC 311 service request analytics with implemented bronze, silver, and gold processing, reusable data quality checks, dimensional modeling, and reporting marts.

azure-data-factory azure-data-lake-storage data-engineering databricks delta-lake dimensional-modeling lakehouse medallion-architecture power-bi pyspark

Last synced: 14 May 2026

https://github.com/lupusruber/music_analytics

This project processes real-time music event data using Kafka, Apache Spark on Google Cloud Dataproc, and stores the transformed data in BigQuery for analytics, all orchestrated by Airflow and managed with Terraform.

bigquery data-proc dimensional-modeling gcp-project kafka spark-structured-streaming

Last synced: 01 May 2026

https://github.com/husskhosravi/elt-data-warehouse-snowflake

Design and implement a full ELT data pipeline using Snowflake and S3, featuring star schema modelling, SCD Type 1 & 2 handling, and incremental load automation

aws-s3 data-engineering data-warehouse dimensional-modeling scd-type-1 scd-type-2 snowflake sql star-schema

Last synced: 03 Feb 2026

https://github.com/deepakramani/etl_with_dbt_dwh

A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.

analytics-engineering data-engineering dbt dimensional-modeling etl jinja sql

Last synced: 30 Mar 2025

https://github.com/gades-dataeng/webinar

Code, scripts, and resources for the Data Engineering Fundamentals Course Webinar, covering Python, data pipelines, Apache Airflow, and more.

apache-airflow data-engineering data-orchestration data-orchestrator data-pipelines dimensional-modeling python sql

Last synced: 31 Mar 2025