https://github.com/oertl/bagminhash
BagMinHash - Minwise Hashing Algorithm for Weighted Sets
https://github.com/oertl/bagminhash
jaccard-similarity jaccard-similarity-estimation locality-sensitive-hashing minhash minwise-hashing minwise-hashing-algorithm weighted-sets
Last synced: 3 months ago
JSON representation
BagMinHash - Minwise Hashing Algorithm for Weighted Sets
- Host: GitHub
- URL: https://github.com/oertl/bagminhash
- Owner: oertl
- Created: 2018-02-07T09:32:25.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2020-08-26T05:43:40.000Z (about 5 years ago)
- Last Synced: 2025-04-13T16:45:34.366Z (6 months ago)
- Topics: jaccard-similarity, jaccard-similarity-estimation, locality-sensitive-hashing, minhash, minwise-hashing, minwise-hashing-algorithm, weighted-sets
- Language: C++
- Homepage:
- Size: 1.02 MB
- Stars: 26
- Watchers: 2
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# BagMinHash - Minwise Hashing Algorithm for Weighted Sets
For details see
* [arXiv preprint](https://arxiv.org/abs/1802.03914)
* [KDD 2018 paper](http://www.kdd.org/kdd2018/accepted-papers/view/bagminhash-minwise-hashing-algorithm-for-weighted-sets)For non-streaming setups there is a faster alternative in the meantime: [TreeMinHash](https://github.com/oertl/treeminhash)