https://github.com/endymecy/algorithmsonspark
Some popular algorithms(dbscan,knn,fm etc.) on spark
https://github.com/endymecy/algorithmsonspark
dbscan factorization-machines knn spark
Last synced: about 1 year ago
JSON representation
Some popular algorithms(dbscan,knn,fm etc.) on spark
- Host: GitHub
- URL: https://github.com/endymecy/algorithmsonspark
- Owner: endymecy
- Created: 2016-10-17T07:32:15.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2018-05-29T03:31:14.000Z (about 8 years ago)
- Last Synced: 2025-03-25T21:12:33.222Z (about 1 year ago)
- Topics: dbscan, factorization-machines, knn, spark
- Language: Scala
- Homepage: https://github.com/endymecy/AlgorithmsOnSpark
- Size: 123 KB
- Stars: 32
- Watchers: 7
- Forks: 16
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Distributed Algorithms On Spark
This project implement some popular algorithms on spark.You can read the papers of them to see their details.
Currently it support the following algorithms and I will add some other algorithms in the future.
- Distributed KNN
- Down Sampling
- Over Sampling
- Affinity Propagation
- Distributed t-SNE
- Factorization Machines
- Multi-view Machines
- Block Structures Factorization Machines
- Timeseries models
- DBSCAN
This project support spark 2.x
## reference
- https://github.com/viirya/SparkAffinityPropagation
- https://github.com/saurfang/spark-tsne
- https://github.com/cloudml/zen
- https://github.com/sryza/spark-timeseries
- https://github.com/irvingc/dbscan-on-spark
- http://mlwiki.org/index.php/Metric_Trees