https://github.com/airscholar/modern-data-eng-dbt-databricks-azure
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.
https://github.com/airscholar/modern-data-eng-dbt-databricks-azure
apache-spark azure databricks dbt modern-data-engineering
Last synced: about 1 year ago
JSON representation
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.
- Host: GitHub
- URL: https://github.com/airscholar/modern-data-eng-dbt-databricks-azure
- Owner: airscholar
- Created: 2023-12-18T15:00:39.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-12-18T15:35:38.000Z (over 2 years ago)
- Last Synced: 2025-03-24T02:21:58.898Z (about 1 year ago)
- Topics: apache-spark, azure, databricks, dbt, modern-data-engineering
- Homepage: https://youtu.be/divjURi-low
- Size: 118 KB
- Stars: 27
- Watchers: 3
- Forks: 13
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Modern Data Engineering with Medallion Architecture using DBT, Databricks, Spark and Azure Cloud
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider. This project illustrate the process of data ingestion to the lakehouse, data integration with ADF and data transformation with Databricks, and DBT.
## System Architecture

## Commands
Try running the following commands:
- dbt run # for running models
- dbt test # for tests
- dbt snapshot # for snapshotting and slowly changing dimensions
- dbt docs generate # for documentation
- dbt docs serve # for documentation preview
## Resources:
* [Medium Article](https://medium.com/@yusuf.ganiyu/robust-data-pipelines-with-databricks-spark-dbt-and-azure-data-engineering-project-e5780fbc07a6)
* [DBT](https://docs.getdbt.com/guides)
* [Databricks](https://docs.databricks.com/)
* [Azure](https://docs.microsoft.com/en-us/azure/?product=featured)
* [Azure Data Factory](https://docs.microsoft.com/en-us/azure/data-factory/)
* [Azure Data Lake Storage Gen2](https://docs.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-introduction)
### Youtube Video
[](https://youtu.be/divjURi-low)