{"id":18400687,"url":"https://github.com/databricks/databricks-asset-bundles-dais2023","last_synced_at":"2025-09-22T17:31:39.122Z","repository":{"id":182848093,"uuid":"657223604","full_name":"databricks/databricks-asset-bundles-dais2023","owner":"databricks","description":null,"archived":false,"fork":false,"pushed_at":"2024-07-15T15:12:09.000Z","size":1040,"stargazers_count":50,"open_issues_count":11,"forks_count":56,"subscribers_count":4,"default_branch":"main","last_synced_at":"2025-01-07T08:10:25.151Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/databricks.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":".github/support/databricks/__init__.py","governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-06-22T15:27:48.000Z","updated_at":"2024-12-14T20:59:35.000Z","dependencies_parsed_at":"2024-11-06T03:14:30.426Z","dependency_job_id":null,"html_url":"https://github.com/databricks/databricks-asset-bundles-dais2023","commit_stats":null,"previous_names":["databricks/databricks-asset-bundles-dais2023"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/databricks%2Fdatabricks-asset-bundles-dais2023","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/databricks%2Fdatabricks-asset-bundles-dais2023/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/databricks%2Fdatabricks-asset-bundles-dais2023/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/databricks%2Fdatabricks-asset-bundles-dais2023/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/databricks","download_url":"https://codeload.github.com/databricks/databricks-asset-bundles-dais2023/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":233869029,"owners_count":18743094,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-06T02:36:02.514Z","updated_at":"2025-09-22T17:31:38.587Z","avatar_url":"https://github.com/databricks.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Databricks Asset Bundles\n\n_Write code once, deploy to lakehouses everywhere..._\n\n\u003cimg src='./data/bundle.jpeg' width=400\u003e\n\n_Click [here](https://www.youtube.com/watch?v=9HOgYVo-WTM) to watch the talk on Databricks Asset Bundles at Data \u0026 AI Summit 2023._\n\n_The slides from the presentation are available [here](https://docs.google.com/presentation/d/1bnnTR19j_nZhB0bDCMoGga-8Sq6eBjhBAom-6NJ6F0I/edit?usp=sharing)._\n\n## Introduction\n_Databricks Asset Bundles_, also known simply as bundles, enable you to programmatically validate, deploy, and run the projects you are working on in Databricks via the [Databricks CLI](https://github.com/databricks/cli).  A bundle is a collection of one or more related files that contain:\n\n* Any local artifacts (such as source code) to deploy to a remote Databricks workspace prior to running any related Databricks workflows.\n\n* The declarations and settings for the Databricks jobs, Delta Live Tables pipelines, or [MLOps Stacks](https://github.com/databricks/mlops-stack) that act upon the artifacts that were deployed into the workspace.\n\nFor more information on bundles, please see the following pages in Databricks documentation:\n\n#### Tutorials\n* [Bundle development tasks](https://docs.databricks.com/dev-tools/bundles/work-tasks.html)\n* [How to use Bundles with Databricks Workflows (aka Jobs)](https://docs.databricks.com/workflows/jobs/how-to/use-bundles-with-jobs.html)\n* [Automate a DLT pipeline with DABs](https://docs.databricks.com/delta-live-tables/tutorial-bundles.html)\n* [Run A CI/CD process with DABs and GitHub Actions](https://docs.databricks.com/dev-tools/bundles/ci-cd.html)\n\n#### Reference \n* [bundle settings reference](https://docs.databricks.com/dev-tools/bundles/settings.html)\n* [bundle command group reference](https://docs.databricks.com/dev-tools/cli/bundle-commands.html)\n\n## Analyzing Databricks Medium Posts from Field Engineering\nIn this repo you'll find a simple project consisting of:\n\n1. A CSV containing URLs of Medium.com blogs written by Field Engineers at Databricks.\n2. A Delta Live Tables pipeline to ingest and process that data, including logic to scrape Medium.com for the number of claps and reading time.\n3. A notebook report that reads the processed data and visualizes it.\n4. A Databricks Workflow with two tasks - the first to refresh the DLT pipeline and the second to execute the notebook report.\n\nThese data assets are represented in the `bundle.yml` file in the project root directory.  \n\n#### Deploying and running this repo\nMake sure you have the Databricks CLI installed, then you can use the `databricks bundle` commands.  You'll also want to edit the `bundle.yml` and specify the Databricks Workspace that you plan to deploy to.  Once you've got that sorted out, you can deploy and run the project using the following commands:\n\n```\ndatabricks bundle deploy\ndatabricks bundle run fe_medium_metrics\n```\n\n## Questions?\nPlease email dabs-preview@databricks.com if you have questions on DABs or if you have questions on the code in this repo, please email rafi.kurlansik@databricks.com\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatabricks%2Fdatabricks-asset-bundles-dais2023","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdatabricks%2Fdatabricks-asset-bundles-dais2023","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatabricks%2Fdatabricks-asset-bundles-dais2023/lists"}