Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ossinova/awesome-databricks

A collection of useful and awesome Databricks resources
https://github.com/ossinova/awesome-databricks

List: awesome-databricks

awesome awesome-list databricks guide lists resources

Last synced: 16 days ago
JSON representation

A collection of useful and awesome Databricks resources

Awesome Lists containing this project

README

        

# Awesome Databricks
[![Links](https://github.com/ossinova/awesome-databricks/actions/workflows/links.yml/badge.svg)](https://github.com/ossinova/awesome-databricks/actions/workflows/links.yml) [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)

## Contents
- [Awesome Databricks](#awesome-databricks)
- [Technology](#technology)
- [Apache Spark](#apache-spark)
- [Delta Lake](#delta-lake)
- [Delta Sharing](#delta-sharing)
- [MLflow](#mlflow)
- [Koalas](#koalas)
- [Cloud](#cloud)
- [AWS](#aws)
- [Azure](#azure)
- [GCP](#gcp)
- [External Resources](#external-resources)
- [Blogs](#blogs)
- [YouTube](#youtube)
- [Repos](#repos)
- [Influencers](#influencers)
- [Courses](#courses)
- [Books](#books)
- [Internal Resources](#internal-resources)
- [Databricks Blog](#databricks-blog)
- [Databricks Documentation](#databricks-documentation)
- [Databricks Certifications](#databricks-certifications)
- [Databricks Academy](#databricks-academy)
- [Databricks Community Edition](#databricks-community-edition)
- [Databricks Community Forum](#databricks-community-forum)
- [Databricks Demos](#databricks-demos)
- [Databricks Release Notes](#databricks-release-notes)
- [Contributing](#contributing)
- [License](#license)

## Technology
### Apache Spark
* [Apache Spark](https://spark.apache.org/) - Official site
* [Docs](https://spark.apache.org/docs/latest/) - Documentation
* [Awesome Spark](https://github.com/awesome-spark/awesome-spark#readme) - Awesome list for Apache Spark
### Delta Lake
* [Delta Lake](https://delta.io/) - Official site
* [Docs](https://docs.delta.io/latest/index.html) - Documentation
### Delta Sharing
* [Delta Sharing](https://delta.io/sharing/) - Official site
* [Docs](https://docs.databricks.com/en/data-sharing/index.html) - Documentation
### MLflow
* [MLflow](https://mlflow.org/) - Official site
* [Docs](https://mlflow.org/docs/latest/index.html) - Documentation
### Koalas
* [Koalas](https://koalas.readthedocs.io/en/latest/index.html) - Official site
* [Docs](https://koalas.readthedocs.io/en/latest/user_guide/index.html) - Documentation
---

## Cloud
### AWS
* [Awesome AWS](https://github.com/donnemartin/awesome-aws#readme) - Awesome AWS list
### Azure
* [Awesome Azure](https://github.com/dem108/Awesome-Azure-Advanced-Analytics) - Awesome Azure (Advanced Analytics) list
* [Awesome Azure Databricks](https://github.com/tfayyaz/awesome-azure-databricks) - Awesome Azure Databricks list
### GCP
* [Awesome GCP](https://github.com/GoogleCloudPlatform/awesome-google-cloud) - Awesome Google Cloud list

---

## External Resources
### Blogs
* [Medium](https://medium.com/tag/databricks) - Databricks tagged blog articles
* [Towards Data Science](https://towardsdatascience.com/tagged/databricks) - Databricks tagged blog articles
* [LinkedIn](https://www.linkedin.com/search/results/content/?contentType=%22documents%22&heroEntityKey=urn%3Ali%3Aorganization%3A3477522&keywords=databricks&origin=FACETED_SEARCH&position=0&searchId=a66016f8-2e52-4aaa-871b-660fdab798fd&sid=dPV) - Databricks tagged posts with document
### YouTube
* [Databricks](https://www.youtube.com/@Databricks) - Official Databricks Youtube channel
* [Advancing Analytics](https://www.youtube.com/@AdvancingAnalytics) - You'll find a whole ton of spark, Databricks, Azure and more.
### Repos
* [Delta Lake](https://github.com/delta-io/delta) - Storage layer with ACID transactions.
* [quinn](https://github.com/mrpowers/quinn) - PySpark methods to enhance developer productivity.
* [chispa](https://github.com/mrpowers/chispa) - PySpark test helper methods with beautiful error messages
* [mack](https://github.com/mrpowers/mack) - Delta Lake helper methods in PySpark
* [spark-style-guide](https://github.com/mrpowers/spark-style-guide) - Spark style guide
* [scala-style-guide](https://github.com/databricks/scala-style-guide) - Databricks Scala Coding Style Guide
* [nutter](https://github.com/microsoft/nutter) - Testing framework for Databricks notebooks
### Influencers
People to follow on LinkedIn:
* [Ali Ghodsi](https://www.linkedin.com/in/alighodsi/) - Databricks Founder & CEO
* [Matthew Powers](https://www.linkedin.com/in/matthew-powers-cfa/) - Databricks Developer Advocate & creator of quinn, mack, chispa
* [Danny Lee](https://www.linkedin.com/in/dennyglee/) - Databricks Developer Advocate & deltalake expert
* [Oscar Dyremyhr](https://www.linkedin.com/in/oscar-dyremyhr/) - Databricks Champion & creator of awesome-databricks
* [Simon Whiteley](https://www.linkedin.com/in/simon-whiteley-uk/) - Public Speaker & creator of Advancing Analytics YT content
* [Youssef Mrini](https://www.linkedin.com/in/yousseefmrini/) - Databricks Solution Engineer & blogger
### Courses

TBD

### Books

TBD

---

## Internal Resources
### Databricks Blog

* [Official Blog](https://www.databricks.com/blog/)

Recent blog posts:
* [Introducing Databricks Assistant for Help](https://www.databricks.com/blog/introducing-databricks-assistant-help)
* [Data Intelligence Platforms](https://www.databricks.com/blog/what-is-a-data-intelligence-platform)
* [Lakehouse as a Common Language Across The Enterprise](https://www.databricks.com/blog/lakehouse-common-language-across-enterprise)

### Databricks Documentation

Databricks offers comprehensive documentation for all three major supported cloud vendors:
* [AWS](https://docs.databricks.com/en/index.html)
* [Azure](https://learn.microsoft.com/en-us/azure/databricks/)
* [GCP](https://docs.gcp.databricks.com/en/index.html)

### Databricks Certifications

Databricks offers a variety of certifications for Data Engineers, Data Analysts, and ML/Data Scientist.

* [All Certifications](https://www.databricks.com/learn/certification)
* [Data Analyst](https://www.databricks.com/learn/certification#data-analyst)
* [Data Engineer](https://www.databricks.com/learn/certification#data-engineer)
* [ML/Data Scientist](https://www.databricks.com/learn/certification#data-scientist)
* [Speciality Badges](https://www.databricks.com/learn/certification#specialty)
* [FAQ](https://www.databricks.com/learn/certification/faq)

### Databricks Academy

* [Academy Learning Platform](https://www.databricks.com/learn/training/home)
* [Learning paths](https://www.databricks.com/learn/training/learning-paths)

**Mobile App**: Download [Go.Learn for Android](https://play.google.com/store/apps/details?id=com.docebo.eolo.staging&hl=en&gl=US&pli=1) or [Go.Learn for iOS](https://apps.apple.com/us/app/go-learn/id1362542319)

### Databricks Community Edition

[Databricks Community Edition](https://docs.databricks.com/en/getting-started/community-edition.html) is a limited but free version of Databricks for personal and training use.

### Databricks Community Forum

A [community forum](https://www.databricks.com/discover/introduction-to-data-analysis-workshop-series#:~:text=Customer%20Support-,Community,-Training%20and%20Certification) similar to that of StackOverflow just for Databricks related queries.

### Databricks Demos

Databricks offers [dbdemos](https://www.databricks.com/resources/demos/tutorials), a one-line installation tool. Install demos in your workspace to quickly access best practices for data ingestion, governance, security, data science and data warehousing.

Categories:
* [Lakehouse Platform](https://www.databricks.com/resources/demos/tutorials#lakehouse-platform)
* [Data Engineering](https://www.databricks.com/resources/demos/tutorials#data-engineering)
* [Data Scienece and AI](https://www.databricks.com/resources/demos/tutorials#data-science%20and%20ai)
* [Data warehousing and BI](https://www.databricks.com/resources/demos/tutorials#data-warehouse%20and%20bi)
* [Governance and Unity Catalog](https://www.databricks.com/resources/demos/tutorials#governance-and%20unity%20catalog)

### Databricks Release Notes

Follow the roadmap and latest news by subscribing or watching [Databricks Release Notes](https://docs.databricks.com/en/release-notes/index.html)

Topic:
* [Platform](https://docs.databricks.com/en/release-notes/product/index.html)
* [Runtime](https://docs.databricks.com/en/release-notes/runtime/index.html)
* [Databricks SQL](https://docs.databricks.com/en/sql/release-notes/index.html)
* [Developer Tools](https://docs.databricks.com/en/release-notes/dev-tools/index.html)
* [Databricks Connect](https://docs.databricks.com/en/release-notes/dbconnect/index.html)
* [Databricks Live Tables](https://docs.databricks.com/en/release-notes/delta-live-tables/index.html)

---

Contributing
====
Your contributions are always welcome, just follow [the rules](https://github.com/ossinova/awesome-databricks/blob/master/CONTRIBUTING.md)!

License
====
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.