An open API service indexing awesome lists of open source software.

https://github.com/oertl/bagminhash

BagMinHash - Minwise Hashing Algorithm for Weighted Sets
https://github.com/oertl/bagminhash

jaccard-similarity jaccard-similarity-estimation locality-sensitive-hashing minhash minwise-hashing minwise-hashing-algorithm weighted-sets

Last synced: 3 months ago
JSON representation

BagMinHash - Minwise Hashing Algorithm for Weighted Sets

Awesome Lists containing this project

README

          

# BagMinHash - Minwise Hashing Algorithm for Weighted Sets

For details see

* [arXiv preprint](https://arxiv.org/abs/1802.03914)
* [KDD 2018 paper](http://www.kdd.org/kdd2018/accepted-papers/view/bagminhash-minwise-hashing-algorithm-for-weighted-sets)

For non-streaming setups there is a faster alternative in the meantime: [TreeMinHash](https://github.com/oertl/treeminhash)