Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ossinova/awesome-databricks
A collection of useful and awesome Databricks resources
https://github.com/ossinova/awesome-databricks
List: awesome-databricks
awesome awesome-list databricks guide lists resources
Last synced: 16 days ago
JSON representation
A collection of useful and awesome Databricks resources
- Host: GitHub
- URL: https://github.com/ossinova/awesome-databricks
- Owner: ossinova
- License: cc0-1.0
- Created: 2023-12-21T01:32:06.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-12-21T05:33:18.000Z (about 1 year ago)
- Last Synced: 2024-04-21T11:15:20.457Z (8 months ago)
- Topics: awesome, awesome-list, databricks, guide, lists, resources
- Homepage:
- Size: 48.8 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE-OF-CONDUCT.md
Awesome Lists containing this project
- ultimate-awesome - awesome-databricks - A collection of useful and awesome Databricks resources. (Other Lists / PowerShell Lists)
README
# Awesome Databricks
[![Links](https://github.com/ossinova/awesome-databricks/actions/workflows/links.yml/badge.svg)](https://github.com/ossinova/awesome-databricks/actions/workflows/links.yml) [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)## Contents
- [Awesome Databricks](#awesome-databricks)
- [Technology](#technology)
- [Apache Spark](#apache-spark)
- [Delta Lake](#delta-lake)
- [Delta Sharing](#delta-sharing)
- [MLflow](#mlflow)
- [Koalas](#koalas)
- [Cloud](#cloud)
- [AWS](#aws)
- [Azure](#azure)
- [GCP](#gcp)
- [External Resources](#external-resources)
- [Blogs](#blogs)
- [YouTube](#youtube)
- [Repos](#repos)
- [Influencers](#influencers)
- [Courses](#courses)
- [Books](#books)
- [Internal Resources](#internal-resources)
- [Databricks Blog](#databricks-blog)
- [Databricks Documentation](#databricks-documentation)
- [Databricks Certifications](#databricks-certifications)
- [Databricks Academy](#databricks-academy)
- [Databricks Community Edition](#databricks-community-edition)
- [Databricks Community Forum](#databricks-community-forum)
- [Databricks Demos](#databricks-demos)
- [Databricks Release Notes](#databricks-release-notes)
- [Contributing](#contributing)
- [License](#license)## Technology
### Apache Spark
* [Apache Spark](https://spark.apache.org/) - Official site
* [Docs](https://spark.apache.org/docs/latest/) - Documentation
* [Awesome Spark](https://github.com/awesome-spark/awesome-spark#readme) - Awesome list for Apache Spark
### Delta Lake
* [Delta Lake](https://delta.io/) - Official site
* [Docs](https://docs.delta.io/latest/index.html) - Documentation
### Delta Sharing
* [Delta Sharing](https://delta.io/sharing/) - Official site
* [Docs](https://docs.databricks.com/en/data-sharing/index.html) - Documentation
### MLflow
* [MLflow](https://mlflow.org/) - Official site
* [Docs](https://mlflow.org/docs/latest/index.html) - Documentation
### Koalas
* [Koalas](https://koalas.readthedocs.io/en/latest/index.html) - Official site
* [Docs](https://koalas.readthedocs.io/en/latest/user_guide/index.html) - Documentation
---## Cloud
### AWS
* [Awesome AWS](https://github.com/donnemartin/awesome-aws#readme) - Awesome AWS list
### Azure
* [Awesome Azure](https://github.com/dem108/Awesome-Azure-Advanced-Analytics) - Awesome Azure (Advanced Analytics) list
* [Awesome Azure Databricks](https://github.com/tfayyaz/awesome-azure-databricks) - Awesome Azure Databricks list
### GCP
* [Awesome GCP](https://github.com/GoogleCloudPlatform/awesome-google-cloud) - Awesome Google Cloud list---
## External Resources
### Blogs
* [Medium](https://medium.com/tag/databricks) - Databricks tagged blog articles
* [Towards Data Science](https://towardsdatascience.com/tagged/databricks) - Databricks tagged blog articles
* [LinkedIn](https://www.linkedin.com/search/results/content/?contentType=%22documents%22&heroEntityKey=urn%3Ali%3Aorganization%3A3477522&keywords=databricks&origin=FACETED_SEARCH&position=0&searchId=a66016f8-2e52-4aaa-871b-660fdab798fd&sid=dPV) - Databricks tagged posts with document
### YouTube
* [Databricks](https://www.youtube.com/@Databricks) - Official Databricks Youtube channel
* [Advancing Analytics](https://www.youtube.com/@AdvancingAnalytics) - You'll find a whole ton of spark, Databricks, Azure and more.
### Repos
* [Delta Lake](https://github.com/delta-io/delta) - Storage layer with ACID transactions.
* [quinn](https://github.com/mrpowers/quinn) - PySpark methods to enhance developer productivity.
* [chispa](https://github.com/mrpowers/chispa) - PySpark test helper methods with beautiful error messages
* [mack](https://github.com/mrpowers/mack) - Delta Lake helper methods in PySpark
* [spark-style-guide](https://github.com/mrpowers/spark-style-guide) - Spark style guide
* [scala-style-guide](https://github.com/databricks/scala-style-guide) - Databricks Scala Coding Style Guide
* [nutter](https://github.com/microsoft/nutter) - Testing framework for Databricks notebooks
### Influencers
People to follow on LinkedIn:
* [Ali Ghodsi](https://www.linkedin.com/in/alighodsi/) - Databricks Founder & CEO
* [Matthew Powers](https://www.linkedin.com/in/matthew-powers-cfa/) - Databricks Developer Advocate & creator of quinn, mack, chispa
* [Danny Lee](https://www.linkedin.com/in/dennyglee/) - Databricks Developer Advocate & deltalake expert
* [Oscar Dyremyhr](https://www.linkedin.com/in/oscar-dyremyhr/) - Databricks Champion & creator of awesome-databricks
* [Simon Whiteley](https://www.linkedin.com/in/simon-whiteley-uk/) - Public Speaker & creator of Advancing Analytics YT content
* [Youssef Mrini](https://www.linkedin.com/in/yousseefmrini/) - Databricks Solution Engineer & blogger
### CoursesTBD
### Books
TBD
---
## Internal Resources
### Databricks Blog* [Official Blog](https://www.databricks.com/blog/)
Recent blog posts:
* [Introducing Databricks Assistant for Help](https://www.databricks.com/blog/introducing-databricks-assistant-help)
* [Data Intelligence Platforms](https://www.databricks.com/blog/what-is-a-data-intelligence-platform)
* [Lakehouse as a Common Language Across The Enterprise](https://www.databricks.com/blog/lakehouse-common-language-across-enterprise)### Databricks Documentation
Databricks offers comprehensive documentation for all three major supported cloud vendors:
* [AWS](https://docs.databricks.com/en/index.html)
* [Azure](https://learn.microsoft.com/en-us/azure/databricks/)
* [GCP](https://docs.gcp.databricks.com/en/index.html)
### Databricks CertificationsDatabricks offers a variety of certifications for Data Engineers, Data Analysts, and ML/Data Scientist.
* [All Certifications](https://www.databricks.com/learn/certification)
* [Data Analyst](https://www.databricks.com/learn/certification#data-analyst)
* [Data Engineer](https://www.databricks.com/learn/certification#data-engineer)
* [ML/Data Scientist](https://www.databricks.com/learn/certification#data-scientist)
* [Speciality Badges](https://www.databricks.com/learn/certification#specialty)
* [FAQ](https://www.databricks.com/learn/certification/faq)### Databricks Academy
* [Academy Learning Platform](https://www.databricks.com/learn/training/home)
* [Learning paths](https://www.databricks.com/learn/training/learning-paths)
**Mobile App**: Download [Go.Learn for Android](https://play.google.com/store/apps/details?id=com.docebo.eolo.staging&hl=en&gl=US&pli=1) or [Go.Learn for iOS](https://apps.apple.com/us/app/go-learn/id1362542319)### Databricks Community Edition
[Databricks Community Edition](https://docs.databricks.com/en/getting-started/community-edition.html) is a limited but free version of Databricks for personal and training use.
### Databricks Community Forum
A [community forum](https://www.databricks.com/discover/introduction-to-data-analysis-workshop-series#:~:text=Customer%20Support-,Community,-Training%20and%20Certification) similar to that of StackOverflow just for Databricks related queries.
### Databricks Demos
Databricks offers [dbdemos](https://www.databricks.com/resources/demos/tutorials), a one-line installation tool. Install demos in your workspace to quickly access best practices for data ingestion, governance, security, data science and data warehousing.
Categories:
* [Lakehouse Platform](https://www.databricks.com/resources/demos/tutorials#lakehouse-platform)
* [Data Engineering](https://www.databricks.com/resources/demos/tutorials#data-engineering)
* [Data Scienece and AI](https://www.databricks.com/resources/demos/tutorials#data-science%20and%20ai)
* [Data warehousing and BI](https://www.databricks.com/resources/demos/tutorials#data-warehouse%20and%20bi)
* [Governance and Unity Catalog](https://www.databricks.com/resources/demos/tutorials#governance-and%20unity%20catalog)### Databricks Release Notes
Follow the roadmap and latest news by subscribing or watching [Databricks Release Notes](https://docs.databricks.com/en/release-notes/index.html)
Topic:
* [Platform](https://docs.databricks.com/en/release-notes/product/index.html)
* [Runtime](https://docs.databricks.com/en/release-notes/runtime/index.html)
* [Databricks SQL](https://docs.databricks.com/en/sql/release-notes/index.html)
* [Developer Tools](https://docs.databricks.com/en/release-notes/dev-tools/index.html)
* [Databricks Connect](https://docs.databricks.com/en/release-notes/dbconnect/index.html)
* [Databricks Live Tables](https://docs.databricks.com/en/release-notes/delta-live-tables/index.html)
---
Contributing
====
Your contributions are always welcome, just follow [the rules](https://github.com/ossinova/awesome-databricks/blob/master/CONTRIBUTING.md)!License
====
This work is licensed under a Creative Commons Attribution 4.0 International License.