Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/JorenSix/TarsosLSH

A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It implements Locality-sensitive Hashing (LSH) and multi index hashing for hamming space.

java lsh multi-dimensional-hashing nearest-neighbor-search

Last synced: 05 Jun 2024

https://github.com/ekzhu/datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

data-sketches data-summary hnsw hyperloglog jaccard-similarity locality-sensitive-hashing lsh lsh-ensemble lsh-forest minhash python search top-k weighted-quantiles

Last synced: 05 Jun 2024

https://github.com/davidsvy/Neural-Scam-Artist

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

dataset deduplication fine-tuning fraud gpt2 huggingface lsh minhash nlp pytorch readability scam transformer web-scraping

Last synced: 13 May 2024

https://github.com/FALCONN-LIB/FALCONN

FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)

cosine-similarity falconn fast-lookups locality-sensitive-hashing lsh nearest-neighbor-search sketches

Last synced: 13 May 2024

https://github.com/DRSY/MoTIS

Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.

ai clip cross-modal image-search ios-swift k-means k-means-clustering knn knowledge-distillation lsh naacl random-projection retrieval semantic-search vector-search

Last synced: 20 Apr 2024

https://github.com/src-d/minhashcuda

Weighted MinHash implementation on CUDA (multi-gpu).

cuda lsh machine-learning minhash

Last synced: 19 Apr 2024

https://github.com/ritchie46/lsh-rs

Locality Sensitive Hashing in Rust with Python bindings

cosine-similarity l2-distance lsh lsh-algorithm rust

Last synced: 01 Apr 2024