An open API service indexing awesome lists of open source software.

https://github.com/endymecy/algorithmsonspark

Some popular algorithms(dbscan,knn,fm etc.) on spark
https://github.com/endymecy/algorithmsonspark

dbscan factorization-machines knn spark

Last synced: about 1 year ago
JSON representation

Some popular algorithms(dbscan,knn,fm etc.) on spark

Awesome Lists containing this project

README

          

# Distributed Algorithms On Spark

This project implement some popular algorithms on spark.You can read the papers of them to see their details.

Currently it support the following algorithms and I will add some other algorithms in the future.

- Distributed KNN
- Down Sampling
- Over Sampling
- Affinity Propagation
- Distributed t-SNE
- Factorization Machines
- Multi-view Machines
- Block Structures Factorization Machines
- Timeseries models
- DBSCAN

This project support spark 2.x

## reference

- https://github.com/viirya/SparkAffinityPropagation
- https://github.com/saurfang/spark-tsne
- https://github.com/cloudml/zen
- https://github.com/sryza/spark-timeseries
- https://github.com/irvingc/dbscan-on-spark
- http://mlwiki.org/index.php/Metric_Trees