An open API service indexing awesome lists of open source software.

https://github.com/databricks/notebook-best-practices

An example showing how to apply software engineering best practices to Databricks notebooks.
https://github.com/databricks/notebook-best-practices

Last synced: about 1 month ago
JSON representation

An example showing how to apply software engineering best practices to Databricks notebooks.

Awesome Lists containing this project

README

          

# Software engineering best practices for Databricks notebooks

This repository is a companion for the example article "Software engineering best practices for Databricks notebooks" ([AWS](https://docs.databricks.com/notebooks/best-practices.html) | [Azure](https://docs.microsoft.com/azure/databricks/notebooks/best-practices) | [GCP](https://docs.gcp.databricks.com/notebooks/best-practices.html)).

Going through the example, you will:

* Add notebooks to Databricks Repos for version control.
* Extracts portions of code from one of the notebooks into a shareable component.
* Test the shared code.
* Automatically run notebooks in git on a schedule using a Databricks job.
* Optionally, apply CI/CD to the notebooks and the shared code.

The example is hands-on. We recommend working it step-by-step to learn how to apply these techniques to your own Databricks notebooks.