Projects in Awesome Lists tagged with datawarehousing
A curated list of projects in awesome lists tagged with datawarehousing .
https://github.com/cynkra/dm
Working with relational data models in R
data-model data-warehousing datawarehousing dbi dbplyr r relational-databases
Last synced: 14 May 2025
https://github.com/Datavault-UK/automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
data-vault dataengineering datalake datavault datavault20 datawarehouse datawarehousing dbt elt etl metadata snowflake sql
Last synced: 13 May 2025
https://github.com/datawithbaraa/sql-data-warehouse-project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lakehouse data-science data-warehouse data-warehousing datalake datascience datawarehouse datawarehousing etl etl-job etl-pipeline medallion-architecture sql sql-query sql-server sqlserver
Last synced: 06 Apr 2025
https://github.com/mara/mara-schema
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
data-governance data-modeling datawarehousing metadata python
Last synced: 30 Apr 2025
https://github.com/mohamedhmini/tweetsolaping
implementing an end-to-end tweets ETL/Analysis pipeline.
analysis api-client cube-analysis datawarehouse datawarehousing etl-pipeline google-api-client multi-dimensional-analysis multithreading powerbi-report ssas-multidimensional ssis tweets tweets-classification tweets-scraper twitter-api web-crawling
Last synced: 10 Jun 2025
https://github.com/praveendecode/youtube-data-harvesting-warehousing
Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies
api apiintegration dataanalysis dataharvesting datawarehousing eda mongodb postgres python sql
Last synced: 16 Aug 2025
https://github.com/kstrassheim/datawarehouse-crawler
This is a content and schema crawler tool to receive, update and import various kinds of data into a Onprem or Cloud based SQLServer or Azure-Synapse-Analysis (Azure Datawarehouse SQLServer). As source it supports SQLServer Tables, ODATA Endpoints, CSV Files or Excel Files. For multiple sources it can run in parallel mode where it would make a thread for each connection. The speciality of this crawler is that it creates the target tables by himself using the additional info from source.json. In case of Azure-Synapse-Analysis it would estimate the distribution type and keys. The syncing works completely without SQL Transactions by using a consistency correction algorithm for very frequent fact tables. There are 5 Syncing Algorithms (see Manual/Insert) which can be selected as well as one Update Algorithm.
azure-data-warehouse azure-synapse-analytics business-intelligence crawler csv data-import data-science datawarehouse datawarehousing docker dotnet-core-2 excel integration-testing odata parallel-computing sql
Last synced: 28 Apr 2026
https://github.com/aessing/demo-mdwh
Modern Dataware House Demos with Azure Databricks, Azure Data Factory & Azure Dedicated SQL pool (formerly SQL DW)
azure azure-data-factory azure-databricks data data-engineering data-science databricks databricks-notebooks datafactory datalake datawarehouse datawarehousing delta-lake demos etl machine-learning mdwh ml modern-data-warehouse spark
Last synced: 26 Jun 2025
https://github.com/jaehyeon-kim/iceberg-etl-demo
Data Warehousing ETL Demo with Apache Iceberg on EMR Local Environment
datawarehousing emr etl iceberg scd spark
Last synced: 17 Aug 2025
https://github.com/nouraalgohary/gravity-books-etl-and-data-warehouse
datawarehousing dwh etl mssql-database sqlserver ssas ssis
Last synced: 07 Mar 2026
https://github.com/salma-mamdoh/datawarehouse_project
Our project for Datawarehouse Course taken during fall 2024 semester
analytics dashboard datawarehousing deploy elt kpis powerbi sql ssis
Last synced: 25 Jan 2026
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/artempyanykh/data-zero-to-cloud
Slides and code for my talk 'Data pipelines. From zero to cloud scale'
data-engineering data-processing datawarehousing etl google-cloud google-dataflow
Last synced: 27 Mar 2025
https://github.com/firaskahlaoui/retail-data-warehouse
This project demonstrates the creation of a Data Warehouse using SQL Server 2022. It includes the design of dimension and fact tables, ETL processes for data integration, Python scripts for synthetic data generation, and SQL queries for KPI analysis to support business decision-making.
analytics database databasedesign datawarehousing etl kpi python sqlserver-2022
Last synced: 08 Jul 2025
https://github.com/ranagaballah/datawarehouse_ssis
SSIS (SQL Server Integration Services) project for building a data warehouse solution
database datawarehousing sql ssis tsql
Last synced: 09 Jun 2026
https://github.com/thomasshikalepo/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics
data-analysis data-cleaning data-engineering data-lakehouse data-science data-warehouse data-warehousing datascience datawarehousing etl-pipeline medallion-architecture sql sql-query sql-server
Last synced: 02 Aug 2025
https://github.com/jnbdz/data-warehouse-quickstarts
Data warehouse (DW) quickstarts! :minidisc:
data-warehouse data-warehousing datawarehouse datawarehousing dw dwh enterprise-data-warehouse quickstart quickstarts
Last synced: 19 Mar 2026
https://github.com/phelipe-sempreboni/informations
Repository for tutorials, information and notes on technology in general.
amazon-web-services datahub datalake datamart datawarehouse datawarehousing etl modelagem-de-dados olap oltp oracle-database pl-sql pl-sql-script powerbi-desktop powerbi-service rds-database sql sqlserver
Last synced: 19 Apr 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/mehassanhmood/dataengineering-project
A data engineering project.
cognos-dashboard datawarehousing etl-pipeline mongodb tableau
Last synced: 29 Apr 2026
https://github.com/melinteflxrin/softserve-bigdata-project
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
analytics api bigdata data datawarehousing externalapi pipeline postgres postgresql python warehouse
Last synced: 26 Jan 2026
https://github.com/yrehim7/data_warehouse_project
A complete, easy-to-follow guide on building a modern data warehouse with SQL Server. Learn how to design ETL processes, create effective data models, and leverage analytics for better insights.
data-cleaning data-lakehouse database datawarehouse datawarehousing etl medallion-architecture sql sql-query sql-server
Last synced: 27 Apr 2026
https://github.com/praveendecode/iss-data-warehouse-mongodb-sql-project
Space Exploration Data Fusion : Unleashing the International Space Station Insights with MongoDB and SQL Integration
database dataharvesting datawarehousing etl-pipeline iss mongodb postgresql python sql streamlit-webapp
Last synced: 12 Apr 2026
https://github.com/kmohamedalie/data-warehouse-ibm
Data Warehouse IBM 🔡🔢🏭
business-intelligence coursera dataengineering datawarehousing ibm
Last synced: 14 Jun 2026
https://github.com/muthukumar0908/final_retail_sales_forecast
In this project Utilizing advanced time series forecasting models, successfully predicted department-wide sales for each store for the upcoming year and Visualizing the data in streamlit GUI.
datawarehousing eda modelbuilding python pythonscraping streamlit-webapp
Last synced: 16 May 2026
https://github.com/bayoadejare/dw-optimization-insurance
Insurance Data Warehouse Optimization Project
datawarehousing index insurance optimization sql
Last synced: 28 Jun 2025
https://github.com/gerardo1909/tp_final_pulseras_inteligentes
Trabajo práctico final de la materia "Base de Datos" de la Licenciatura en Ciencia de Datos (UNSAM). 1C-2025
data-science database database-management datawarehousing etl etl-pipeline mongodb neo4j nosql nosql-database postgresql python supabase
Last synced: 11 Apr 2026
https://github.com/anuragmudgal96/data-warehouse-project
Designing and implementing a modern data warehouse on SQL Server, covering ETL pipelines, dimensional modeling, and analytical reporting.
data-analysis data-engineering data-warehouse datawarehousing etl etl-job etl-pipeline sql sql-server
Last synced: 09 Oct 2025
https://github.com/dina-hosny/retail-store-data-modeling-and-analysis-using-datastage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
data-analysis datastage datawarehousing etl extract ibm load transform
Last synced: 06 Mar 2026
https://github.com/ahmed-aquarius/data-warehousing-assignment
Solution of DW assignment with SSIS. Question2: type-2 slowly changing dimension with incremental loading. Quesiton3: Versioning
datawarehousing mssqlserver slowly-changing-dimensions sql ssis-packages versioning
Last synced: 18 Jun 2025
https://github.com/anniefib/otherprojects
Powering Data Dreams: From Orchestration to Analytics with Cloud Precision
airflow-etl-orchestration aws cloud-native-data-solutions data-analysis data-visualization database datamodelling datawarehousing eda end-to-end-data-pipelines machine-learning-models pgadmin4 spark-analytics sql
Last synced: 07 May 2026
https://github.com/yrehim7/data-warehouse-project
A complete, easy-to-follow guide on building a modern data warehouse with SQL Server. Learn how to design ETL processes, create effective data models, and leverage analytics for better insights.
data-cleaning data-lakehouse database datawarehouse datawarehousing etl medallion-architecture sql sql-query sql-server
Last synced: 01 Mar 2025
https://github.com/salma-mamdoh/ejada-internship-project
My Project at my Summer Internship At Data Management Team At EJADA
dashboard data-visualization dataanalysis dataengineering datawarehousing internship-project powerbi reports ssis
Last synced: 27 Jan 2026
https://github.com/sugumarsrinivasan/sql-datawarehouse-project
Building Mordern datawarehouse with SQL Server, including ETL Processes, data modeling, and data analytics.
data-analysis data-analytics data-engineering data-lake data-science data-warehouse datawarehousing etl etl-pipeline medallion-architecture sql sql-query sql-server
Last synced: 24 Oct 2025
https://github.com/zainea-bogdan/data_engineer_project_wowcinema
WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting
analytics big-data data datawarehousing etl-pipeline postgres python sql
Last synced: 19 May 2026
https://github.com/abu14/data_warehousing_bi_data_pipelines
An OLAP system that contains integrated data and enable faster analytics.
business-intelligence databases datawarehouse datawarehousing olap oltp-databases sql
Last synced: 17 Mar 2026
https://github.com/dina-hosny/analyze-and-model-airline-system
Analyzing Airline System and Building Data Warehouse Model to Store the Data and Answer Some Business Questions
data-analysis data-modeling data-warehouse datawarehousing dwh plsql sql
Last synced: 05 Mar 2026
https://github.com/mariann95/sql_data_warehouse_and_analytics_project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics. This repository also contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
data-analysis data-analytics data-cleaning data-engineering data-lakehouse data-science data-science-portfolio data-warehouse data-warehousing datalake datawarehouse datawarehousing etl etl-job etl-pipeline medallion-architecture sql sql-query sql-server sqlserver
Last synced: 06 Jun 2026
https://github.com/moshora99/sql-data-warehouse-project
Build modern data warehouse with mysql, Including ETL processes, data modeling and analytics
data-analysis data-engineering data-science database datawarehouse datawarehousing etl scheme sql sql-query sql-server
Last synced: 27 Apr 2026