An open API service indexing awesome lists of open source software.

https://github.com/ayeujjawalsingh/databricks


https://github.com/ayeujjawalsingh/databricks

change-data-capture change-data-feed data-engineer data-pipeline data-processing databricks deduplication delta-lake delta-live-tables delta-table medallion-architecture

Last synced: 7 months ago
JSON representation

Awesome Lists containing this project

README

          

# πŸ—ΊοΈ Databricks Learning Roadmap

Welcome to the **Databricks Knowledge Base**!

---

## πŸš€ Getting Started

### πŸ”Ή Databricks: Introduction and Architecture
- πŸ“„ [Introduction and Architecture](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%3A%20Introduction%20and%20Architecture/Readme.md)

---

## 🧰 Utilities

### πŸ”Ή Databricks Services
- πŸ“„ [Services Overview](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks-Services/Readme.md)
- πŸ“„ [DBFS, DBUTILS, Magic Commands, and Widgets](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks-Services/DBFS%2C%20DBUTILS%2C%20Magic%20Commands%2C%20and%20Widgets/Readme.md)

---

## 🧱 Lakehouse & Delta Lake

### πŸ”Ή Delta Lake
- πŸ“„ [Overview](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Readme.md)
- πŸ“„ [DML (INSERT, UPDATE, DELETE, MERGE)](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/DML(INSERT%2C%20UPDATE%2C%20DELETE%2C%20and%20MERGE)/Readme.md)
- πŸ“„ [Data Warehouse vs Lake vs Lakehouse](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Data%20Warehouse%20vs%20Data%20Lake%20vs%20Data%20Lakehouse/Readme.md)
- πŸ“„ [Delta Table Types and Features](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Delta%20Table%20Types(Managed%20and%20External%20Table)%20(Creation%2C%20Partitioning%2C%20Schema%20Evolution%2C%20Optimization)/Readme.md)
- πŸ“„ [Delta Transaction Logs](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Delta%20Transaction%20Logs/Readme.md)
- πŸ“„ [History](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/History/Readme.md)
- πŸ“„ [Incremental ETL](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Incremental%20ETL%20with%20Delta/Readme.md)
- πŸ“„ [Medallion Lakehouse Architecture](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Medallion%20Lakehouse%20Architecture/Readme.md)
- πŸ“„ [OPTIMIZE and Z-ORDER](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/OPTIMIZE%20and%20Z-ORDER/Readme.md)
- πŸ“„ [Parquet Table to Delta Table](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Parquet%20Table%20to%20Delta%20Table/Readme.md)
- πŸ“„ [Pitfalls of Data Lakes](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Pitfalls%20of%20Data%20Lakes/Readme.md)
- πŸ“„ [Time Travel](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Time-Travel/Readme.md)
- πŸ“„ [Vacuum](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta-Lake/Vacuum/Readme.md)

---

## 🧩 Data Management

### πŸ”Ή Modelling Data Management Solutions
- πŸ“„ [Autoload into Multiplex Bronze](https://github.com/ayeujjawalsingh/Databricks/blob/main/Modelling%20Data%20Management%20Solutions/Autoload%20into%20Multiplex%20Bronze/Readme.md)
- πŸ“„ [Bronze Ingestion Patterns](https://github.com/ayeujjawalsingh/Databricks/blob/main/Modelling%20Data%20Management%20Solutions/Bronze%20Ingestion%20Patterns/Readme.md)
- πŸ“„ [Quality Enforcement](https://github.com/ayeujjawalsingh/Databricks/tree/main/Modelling%20Data%20Management%20Solutions/Quality%20Enforcement/Readme.md)
- πŸ“„ [Slowly Changing Dimensions (SCD)](https://github.com/ayeujjawalsingh/Databricks/blob/main/Modelling%20Data%20Management%20Solutions/Slowly%20Changing%20Dimensions%20(SCD)/Readme.md)
- πŸ“„ [Streaming Deduplication](https://github.com/ayeujjawalsingh/Databricks/blob/main/Modelling%20Data%20Management%20Solutions/Streaming%20Deduplication/Readme.md)
- πŸ“„ [Streaming from Multiplex Bronze](https://github.com/ayeujjawalsingh/Databricks/blob/main/Modelling%20Data%20Management%20Solutions/Streaming%20from%20Multiplex%20Bronze/Reamd.md)
- πŸ“„ [Type 2 SCD](https://github.com/ayeujjawalsingh/Databricks/blob/main/Modelling%20Data%20Management%20Solutions/Type%202%20SCD/Readme.md)

---

## βš™οΈ Data Engineering & Processing

### πŸ”Ή Data Processing
- πŸ“„ [Change Data Capture](https://github.com/ayeujjawalsingh/Databricks/blob/main/Data%20Processing/Change%20Data%20Capture/Readme.md)
- πŸ“„ [Materialized Gold Tables](https://github.com/ayeujjawalsingh/Databricks/blob/main/Data%20Processing/Materialized%20Gold%20Tables/Readme.md)
- πŸ“„ [Stream-Stream and Stream-Static Joins](https://github.com/ayeujjawalsingh/Databricks/blob/main/Data%20Processing/Stream-Stream%20and%20Stream-Static%20Joins/Readme.md)

### πŸ”Ή Delta Live Tables (DLT)
- πŸ“„ [Introduction to DLT](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta%20Live%20Table/Introductions/Readme.md)
- πŸ“„ [Define Pipeline, Tables, and Views](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta%20Live%20Table/Define%20Pipeline%2C%20Tables%2C%20and%20Views%20with%20DLT/Readme.md)
- πŸ“„ [Incremental Ingest Using SQL + Auto Loader](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta%20Live%20Table/Ingest%20Raw%20Data%20Incrementally%20Using%20SQL%20%2B%20Auto%20Loader%20in%20DLT/Readme.md)
- πŸ“„ [Test and Deploy DLT](https://github.com/ayeujjawalsingh/Databricks/blob/main/Delta%20Live%20Table/Test%20and%20Deploy%20the%20DLT/Readme.md)

---

## 🧠 SQL Analytics

### πŸ”Ή Databricks SQL
- πŸ“„ [Build Visualizations (Dashboards)](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20SQL/Build%20Visualizations%20(Dashboards)/Readme.md)
- πŸ“„ [Configuring SQL Warehouse](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20SQL/Configuring%20SQL%20Warehouse/Readme.md)
- πŸ“„ [Creating Queries and Parameterized Queries](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20SQL/Creating%20Queries%20and%20Parameterized%20Queries/Readme.md)
- πŸ“„ [Databricks SQL - CLI](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20SQL/Databricks%20SQL%20-%20CLI/Readme.md)
- πŸ“„ [Databricks SQL – Components](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20SQL/Databricks%20SQL%20%E2%80%93%20Components/Readme.md)
- πŸ“„ [SQL Alerts](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20SQL/SQL%20Alerts/Readme.md)
- πŸ“„ [Transfer Ownership](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20SQL/Transfer%20Ownership/Readme.md)

---

## πŸ›‘οΈ Data Governance & Security

### πŸ”Ή Data Governance (Manage Permission)
- πŸ“„ [Unity Catalog](https://github.com/ayeujjawalsingh/Databricks/blob/main/Data%20Governance%20(Manage%20Permission)/Unity%20Catalog/Readme.md)
- πŸ“„ [Users, Groups, Access Control, and Delta Sharing](https://github.com/ayeujjawalsingh/Databricks/blob/main/Data%20Governance%20(Manage%20Permission)/Users%2C%20Groups%2C%20Access%20Control%2C%20and%20Delta%20Sharing/Readme.md)

### πŸ”Ή Databricks Security
- πŸ“„ [Security Overview](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks-Security/Readme.md)

---

## πŸ”§ Tooling & DevOps

### πŸ”Ή Databricks Tooling
- πŸ“„ [Databricks Job Configuration](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20Tooling/Databricks%20Job%20Configuration/Readme.md)
- πŸ“„ [Databricks CLI](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20Tooling/Databricks-CLI/Readme.md)
- πŸ“„ [Orchestration and Scheduling with Multi-Task Jobs](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20Tooling/Orchestration%20and%20Scheduling%20with%20Multi-Task%20Jobs/Readme.md)
- πŸ“„ [REST API](https://github.com/ayeujjawalsingh/Databricks/blob/main/Databricks%20Tooling/Rest-API/Readme.md)

### πŸ”Ή Testing and Deployment
- πŸ“„ [Testing & CI/CD](https://github.com/ayeujjawalsingh/Databricks/blob/main/Testing%20and%20Deployment/Readme.md)

---
## πŸ“Š BI & Data Visualization

### πŸ”Ή BI Tools Integration
- πŸ“„ [BI Tools Integration Overview](https://github.com/ayeujjawalsingh/Databricks/blob/main/BI%20Tools%20Integration/Readme.md)

---

## 🧩 Advanced Patterns

### πŸ”Ή High Availability and Disaster Recovery
- πŸ“„ [HA and DR](https://github.com/ayeujjawalsingh/Databricks/blob/main/High%20Availability%20%20and%20Disaster%20Recovery/Readme.md)

---