https://github.com/alexott/databricks-playground
Code samples, etc. for Databricks
https://github.com/alexott/databricks-playground
databricks pyspark
Last synced: 4 months ago
JSON representation
Code samples, etc. for Databricks
- Host: GitHub
- URL: https://github.com/alexott/databricks-playground
- Owner: alexott
- Created: 2020-10-29T08:25:50.000Z (about 5 years ago)
- Default Branch: main
- Last Pushed: 2025-05-30T12:32:24.000Z (7 months ago)
- Last Synced: 2025-05-30T17:24:28.597Z (7 months ago)
- Topics: databricks, pyspark
- Language: Python
- Homepage:
- Size: 202 KB
- Stars: 64
- Watchers: 4
- Forks: 18
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
This repository contains different code samples & other examples related to the Databricks platform & Spark:
* [airflow-dags](airflow-dags) - Examples of Airflow DAGs for Databricks.
* [database-diagram-builder](database-diagram-builder) - tool to generate UML diagram(s) for tables in Databricks/Spark database.
* [dbconnect-maven](dbconnect-maven) - skeleton of the Maven project for simple Spark job, and instructions on how to run it via databricks-connect.
* [dbconnect-package-versions-check](dbconnect-package-versions-check) - tool to checks compatibility of local Databricks connect environment with Databricks cluster.
* [dbconnect-sbt](dbconnect-sbt) - skeleton of the SBT project for simple Spark job, and instructions on how to run it via databricks-connect.
* [dbsql-with-aad-token](dbsql-with-aad-token) - example of querying data on Databricks using [python-sql-connector](https://docs.databricks.com/dev-tools/python-sql-connector.html) library. Authentication to Databricks is performed using Azure Active Directory tokens issued for Azure Service Principal.
* [dbutils-in-jar](dbutils-in-jar) - example of using Databricks dbutils in JVM-based code compiled into `.jar`.
* [ip-access-list-analyzer](ip-access-list-analyzer) - analyzer/fix tool for Databricks IP Access Lists.
* [kafka-eventhubs-aad-auth](kafka-eventhubs-aad-auth) - library to support Azure Active Directory authentication for Spark Kafka & EventHubs connectors accessing Event Hubs.
* [pyspark-snippets](pyspark-snippets) - functions that simplify development of PySpark code for Databricks
* [simba-jdbc-aad-token](simba-jdbc-aad-token) - example of querying data on Databricks using JDBC protocol. Authentication to Databricks is performed using Azure Active Directory tokens issued for Azure Service Principal.
* [spring-jdbc-dbsql](spring-jdbc-dbsql) - example of querying Databricks via JDBC using Spring JDBC.
You can also find more examples of Spark code in the other repositories:
* [spark-playground](https://github.com/alexott/spark-playground) - general Spark code
* [cassandra-dse-playground](https://github.com/alexott/cassandra-dse-playground) - Spark code specific for use with Cassandra/DataStax Enterprise (DSE)