An open API service indexing awesome lists of open source software.

https://github.com/Azure/azure-cosmosdb-spark

Apache Spark Connector for Azure Cosmos DB
https://github.com/Azure/azure-cosmosdb-spark

apache-spark azure-cosmos-db azure-databricks changefeed connector cosmos-db databricks databricks-notebooks jupyter-notebook lambda-architecture pyspark spark

Last synced: 1 day ago
JSON representation

Apache Spark Connector for Azure Cosmos DB

Awesome Lists containing this project

README

        

# Azure Cosmos DB Spark Connector for Spark 3

## NOTE: The connector for Spark 2.4 is not supported anymore.

The Azure Cosmos DB Spark Connector 2.4 has been deprecated and is no longer being supported. All users should start new or migrate applications to the v3 Spark Connector.
Use these links to help you get started building new applications using the Azure Cosmos DB Spark Connector or migrate your old Spark v2 applications.
- Download the [Azure Cosmos DB Spark 3 Connector source code](https://github.com/Azure/azure-sdk-for-java/tree/main/sdk/cosmos/azure-cosmos-spark_3_2-12) here.
- Use the [Spark 3 quick start introduction](https://aka.ms/azure-cosmos-spark-3-quickstart) to quickly get a sample running.
- Use the [Spark 3 End-to-end samples](https://aka.ms/azure-cosmos-spark-3-sample-nyc-taxi-data/01_Batch.ipynb) for a full set of features.
- Keep the [Spark 3 Config Reference](https://aka.ms/azure-cosmos-spark-3-config) for a handy reference to optimize the driver for use.
- The [Spark 2.4 to Spark 3 migration guide](https://aka.ms/azure-cosmos-spark-3-migration) can help you upgrade your applications.