An open API service indexing awesome lists of open source software.

https://github.com/redgerd/databricks-e2e-data-lakehouse

End-to-end data engineering pipeline on Azure Databricks. It covers data ingestion with Autoloader and Structured Streaming, ETL with PySpark, dimensional modeling (star schema), and Slowly Changing Dimensions (SCD). The entire process is orchestrated using Delta Live Tables (DLT), showcasing a robust and production-ready data solution.
https://github.com/redgerd/databricks-e2e-data-lakehouse

Last synced: 9 months ago
JSON representation

End-to-end data engineering pipeline on Azure Databricks. It covers data ingestion with Autoloader and Structured Streaming, ETL with PySpark, dimensional modeling (star schema), and Slowly Changing Dimensions (SCD). The entire process is orchestrated using Delta Live Tables (DLT), showcasing a robust and production-ready data solution.

Awesome Lists containing this project