An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with medallion-architecture

A curated list of projects in awesome lists tagged with medallion-architecture .

https://github.com/mattiasthalen/arcane-insight

Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.

analytics-engineering data-engineering data-vault data-warehouse duckdb elt etl hearthstone medallion-architecture sqlmesh

Last synced: 16 Apr 2025

https://github.com/narius2030/molisa-data-warehouse-integration

Extract data from many databases of Labor, Invalids and Social Affairs sectors and convert to appropriate structure and format, then upload to shared data warehouse and data mart. Thanks to that, people of state agencies can easily retrieve and analyze data based on the compiled data warehouse.

apache-airflow apache-spark api-rest data-pipeline data-warehousing medallion-architecture postgresql

Last synced: 14 Feb 2025

https://github.com/hq969/olympics-dataset-analysis

The Olympic Dataset Analysis project involves exploring and analyzing historical Olympic Games data, including athletes, countries, medals, sports, and event performance. Using data analytics and visualization techniques, this project uncovers patterns in Olympic history, top-performing countries, and athlete achievements.

dataanalytics interaction medallion-architecture timeseries-analysis

Last synced: 17 Mar 2025

https://github.com/narius2030/datalake-solution-imcp

This project involved the development and implementation of a Data Lake architecture to support an AI model capable of generating image captions. The architecture was designed to efficiently ingest, process, and centralized store large volumes of image and text data.

data-lake docker-container etl-pipeline fastapi medallion-architecture mlops nosql-database object-storage

Last synced: 25 Mar 2025

https://github.com/jibbs1703/tickit-data-lake

This repository demonstrates the creation of a robust, 3-tier data lake using an Orchestrator and AWS resources. It collects data from an on-premises NoSQL database and loads it into a SQL database in AWS.

aws-glue aws-glue-crawler aws-glue-data-catalog aws-redshift aws-s3 boto3 data-lake database etl-pipeline medallion-architecture mongodb precommit-hooks

Last synced: 20 Mar 2025

https://github.com/najmaelboutaheri/patents_analysis

This repository contains code and resources for analyzing patents using Apache Spark, Python, and AWS services. The objective of this project is to extract insights and trends from patent data to inform business decisions and intellectual property strategies.

azure azure-databricks azuredatafactory deltalake gui medallion-architecture patents-analysis powerbi-report pyspark

Last synced: 24 Feb 2025

https://github.com/vmrchs/medallion-architecture-pipeline

This Brewery Data Pipeline demonstrates a practical implementation of the medallion architecture (bronze, silver, gold) for data processing. It extracts brewery information from the Open Brewery DB API, transforms the data through multiple stages, and prepares it for analysis.

api docker etl-pipeline medallion-architecture

Last synced: 06 Apr 2025

https://github.com/yrehim7/data-warehouse-project

A complete, easy-to-follow guide on building a modern data warehouse with SQL Server. Learn how to design ETL processes, create effective data models, and leverage analytics for better insights.

data-cleaning data-lakehouse database datawarehouse datawarehousing etl medallion-architecture sql sql-query sql-server

Last synced: 01 Mar 2025

https://github.com/mariann95/sql_data_warehouse_and_analytics_project

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics. This repository also contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

data-analysis data-analytics data-cleaning data-engineering data-lakehouse data-science data-science-portfolio data-warehouse data-warehousing datalake datawarehouse datawarehousing etl etl-job etl-pipeline medallion-architecture sql sql-query sql-server sqlserver

Last synced: 01 Mar 2025

https://github.com/yrehim7/data_warehouse_project

A complete, easy-to-follow guide on building a modern data warehouse with SQL Server. Learn how to design ETL processes, create effective data models, and leverage analytics for better insights.

data-cleaning data-lakehouse database datawarehouse datawarehousing etl medallion-architecture sql sql-query sql-server

Last synced: 10 Apr 2025

https://github.com/nxion/sql-data-warehouse-project

Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.

data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server

Last synced: 03 Mar 2025

https://github.com/duyanh711/robustdatapipelineformoderndataengineering

In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.

apache-spark azure databricks dbt medallion-architecture

Last synced: 06 Mar 2025

https://github.com/mekwiset/medallion_datalakehouse

Building a Data Lakehouse using the Medallion architecture.

azure databricks datafactory dbt medallion-architecture

Last synced: 19 Apr 2025