https://github.com/ingenii-solutions/azure-data-platform-databricks-runtime
Python package and custom runtime to use in Azure Databricks as part of Ingenii's Data Platform
https://github.com/ingenii-solutions/azure-data-platform-databricks-runtime
azure data-platform
Last synced: 2 months ago
JSON representation
Python package and custom runtime to use in Azure Databricks as part of Ingenii's Data Platform
- Host: GitHub
- URL: https://github.com/ingenii-solutions/azure-data-platform-databricks-runtime
- Owner: ingenii-solutions
- License: mpl-2.0
- Created: 2021-08-04T08:34:46.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2025-08-15T09:51:06.000Z (8 months ago)
- Last Synced: 2025-08-15T11:23:30.828Z (8 months ago)
- Topics: azure, data-platform
- Language: Python
- Size: 277 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
README
# Ingenii Databricks Platform
[](https://ingenii.dev/)
[](https://github.com/ingenii-solutions/terraform-azurerm-key-vault/blob/main/LICENSE)
[](https://github.com/ingenii-solutions/data-platform-databricks-runtime/blob/main/CONTRIBUTING.md)
## Details
* Base image: [databricksruntime/standard:9.x](https://hub.docker.com/layers/databricksruntime/standard/9.x/images/sha256-cb414c7ab3c18e529b5e9cada0af996d8912ea7c3ea8087c68da0bb2768c03ab?context=explore)
* Registry: ingeniisolutions
* Repository: databricks-runtime
* Version: 0.7.1
### Intermediate Images
* Base OS Repository: databricks-runtime-base-os
* Base OS Version: 0.2.0
* Base Python Repository: databricks-runtime-base-python
* Base Python Version: 0.2.0
## Overview
This image is used with Databricks' [Container Services](https://docs.databricks.com/clusters/custom-containers.html) to customise the cluster runtime in the engineering cluster of in the [Ingenii Data Platform](https://ingenii.dev/). This contains an installation of [dbt](https://www.getdbt.com/) and [Ingenii's python package for data engineering](https://github.com/ingenii-solutions/azure-data-platform-data-engineering).
## Data Pipeline Overview
For an overview of the data pipeline and the stages it goes through, please refer to the [Data Pipeline documentation](docs/user/DATAPIPELINE.md)
## dbt Integration
For reading files and testing data we use [dbt](https://www.getdbt.com/) as a framework. For an explanation on how we use dbt and how to set up your own data sources, please refer to the [Ingenii Data Engineering Example repository](https://github.com/ingenii-solutions/azure-data-platform-data-engineering-example).
## Contributions
- [dbt-spark](https://github.com/dbt-labs/dbt-spark) - [retrying when any connections fail](https://github.com/dbt-labs/dbt-spark/pull/194)