Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/yokawasa/databricks-notebooks

Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
https://github.com/yokawasa/databricks-notebooks

azure azuredatabricks databricks elt python spark streaming

Last synced: 3 months ago
JSON representation

Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )

Awesome Lists containing this project

README

        

# databricks-notebook

Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )

## Sample Notebooks

| Notebook | Description | Lang
| ------------- | ------------- | ----------- |
| [File Operations Sample](notebooks/file-operations-python.ipynb) | Various file operations sample such as Azure Blob Storage mount & umount, ls/rm/cp/mv, read CSV file, etc| Python |
| [ELT Sample: Azure Blob Stroage - Databricks - CosmosDB](notebooks/elt-blob-storage-cosmosdb-python.ipynb)| In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure Cosmos DB | Python |
| [ELT Sample: Azure Blob Stroage - Databricks - SQLDW](notebooks/elt-blob-storage-sqldw-python.ipynb) | In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure SQL Data Warehouse | Python |
| [Streaming Sample: Cosmos DB ChangeFeed - Databricks](notebooks/tweet-streaming-cosmosdb-python.ipynb) | In this notebook, you read a live stream of tweets that stored in Cosmos DB by leveraging Apache Spart to read the Cosmos DB's Change Feed, and run transformations on the data in Databricks cluster | Python |
| [Streaming Sample: Azure Event Hub - Databricks](notebooks/tweet-streaming-eventhub-python.ipynb) | In this notebook, you connect Azure Event Hub (hyper-scale data ingestion service) with Azure Databricks to stream data into an Apache Spark cluster in near real-time | Python |

## Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/yokawasa/databricks-notebook