https://github.com/rbhatia46/dbscan-on-spark
Small Example demonstrating how we can use DBSCAN with a groupby in a distributed manner across multiple worker nodes, using Pandas UDF in Spark.
https://github.com/rbhatia46/dbscan-on-spark
Last synced: 6 months ago
JSON representation
Small Example demonstrating how we can use DBSCAN with a groupby in a distributed manner across multiple worker nodes, using Pandas UDF in Spark.
- Host: GitHub
- URL: https://github.com/rbhatia46/dbscan-on-spark
- Owner: rbhatia46
- Created: 2020-08-20T20:03:42.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2020-08-20T20:43:34.000Z (about 5 years ago)
- Last Synced: 2025-03-25T00:07:11.438Z (7 months ago)
- Language: Jupyter Notebook
- Size: 187 KB
- Stars: 4
- Watchers: 3
- Forks: 1
- Open Issues: 0