Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yokawasa/databricks-notebooks
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
https://github.com/yokawasa/databricks-notebooks
azure azuredatabricks databricks elt python spark streaming
Last synced: 3 months ago
JSON representation
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
- Host: GitHub
- URL: https://github.com/yokawasa/databricks-notebooks
- Owner: yokawasa
- License: mit
- Created: 2018-06-04T11:00:17.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-06-16T23:46:05.000Z (over 6 years ago)
- Last Synced: 2024-04-17T14:28:50.975Z (10 months ago)
- Topics: azure, azuredatabricks, databricks, elt, python, spark, streaming
- Language: Jupyter Notebook
- Homepage:
- Size: 15.6 KB
- Stars: 75
- Watchers: 9
- Forks: 65
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# databricks-notebook
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
## Sample Notebooks
| Notebook | Description | Lang
| ------------- | ------------- | ----------- |
| [File Operations Sample](notebooks/file-operations-python.ipynb) | Various file operations sample such as Azure Blob Storage mount & umount, ls/rm/cp/mv, read CSV file, etc| Python |
| [ELT Sample: Azure Blob Stroage - Databricks - CosmosDB](notebooks/elt-blob-storage-cosmosdb-python.ipynb)| In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure Cosmos DB | Python |
| [ELT Sample: Azure Blob Stroage - Databricks - SQLDW](notebooks/elt-blob-storage-sqldw-python.ipynb) | In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure SQL Data Warehouse | Python |
| [Streaming Sample: Cosmos DB ChangeFeed - Databricks](notebooks/tweet-streaming-cosmosdb-python.ipynb) | In this notebook, you read a live stream of tweets that stored in Cosmos DB by leveraging Apache Spart to read the Cosmos DB's Change Feed, and run transformations on the data in Databricks cluster | Python |
| [Streaming Sample: Azure Event Hub - Databricks](notebooks/tweet-streaming-eventhub-python.ipynb) | In this notebook, you connect Azure Event Hub (hyper-scale data ingestion service) with Azure Databricks to stream data into an Apache Spark cluster in near real-time | Python |## Contributing
Bug reports and pull requests are welcome on GitHub at https://github.com/yokawasa/databricks-notebook