https://github.com/thaitechtales/databricks
This repository is dedicated to showcasing projects built on Databricks, focusing on big data analytics, data engineering, and machine learning workflows.
https://github.com/thaitechtales/databricks
apache-spark big-data cloud-data-platform data-analytics data-engineering databricks databricks-notebooks etl machine-learning
Last synced: about 2 months ago
JSON representation
This repository is dedicated to showcasing projects built on Databricks, focusing on big data analytics, data engineering, and machine learning workflows.
- Host: GitHub
- URL: https://github.com/thaitechtales/databricks
- Owner: ThaiTechTales
- Created: 2025-01-04T06:52:59.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-10T06:08:01.000Z (over 1 year ago)
- Last Synced: 2025-03-06T18:50:35.669Z (over 1 year ago)
- Topics: apache-spark, big-data, cloud-data-platform, data-analytics, data-engineering, databricks, databricks-notebooks, etl, machine-learning
- Homepage:
- Size: 25.4 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Databricks Projects Repository
## Purpose
This repository is dedicated to showcasing projects built on Databricks, focusing on big data analytics, data engineering, and machine learning workflows. It contains notebooks, scripts, and other resources that demonstrate the capabilities of Databricks in managing and processing large datasets.
## Intention
- To explore the capabilities of Databricks in managing and processing large datasets.
- To build end-to-end data engineering pipelines and demonstrate expertise in Spark and Delta Lake.
- To provide examples of using Databricks for AI and ML model training and deployment.
## Projects
- Notebooks: Databricks notebooks for learning queries, writing data, transformations, functions, and data processing.