awesome-vector-search
Collections of vector search related libraries, service and research papers
https://github.com/currentslab/awesome-vector-search
Last synced: 9 days ago
JSON representation
-
Awesome Vector Search Engine
-
Cloud Service
- Zilliz Cloud - Cloud-native service for Milvus
- Relevance AI - Vector Platform From Experimentation To Deployment
- MyScale - A managed vector database based on ClickHouse
- Epsilla Cloud - The fully managed serverless vector database with 10X faster, cheaper and better.
- DataStax Astra Vector - Multi-cloud, serverless vector DBaaS
- MyScale - A managed vector database based on ClickHouse
-
Library
- ScaNN - A library efficient vector similarity search at scale.
- FLANN - Library written in C++ and contains bindings for the following languages: C, MATLAB, Python, and Ruby
- LangStream - LangStream is an open-source project that combines the best of event-based architectures with the latest Gen AI technologies.
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- JVector - A pure Java, zero dependency, embedded vector search engine used by some of the advanced distributed databases such as DataStax Astra DB & Apache Cassandra™
- Faiss - A library for efficient similarity search and clustering of dense vectors
- Distributed Faiss - Work with FAISS indexes which don't fit into a single server memory
- Autofaiss - Automatically create Faiss knn indices
- NMSLIB - Non-Metric Space Library, an efficient similarity search library for generic non-metric spaces
- Annoy - C++ library with Python bindings to search for points
- LLM App - Open-source Python library for a real-time data KNN (K-Nearest Neighbors) indexing
- MRPT - Fast nearest neighbor search with random projection
- RPForest - Python library for approximate nearest neighbours search
- pgvector - Open-source vector similarity search extension for Postgres
- PASE - Ultra-High-Dimensional approximate nearest neighbor search extension for Postgres
- Pyserini - Toolkit for reproducible information retrieval research with sparse and dense representations
- NGT - Provides commands and a library for performing high-speed approximate nearest neighbor
- NearPy - Approximate search using different locality-sensitive hashing methods
- TOROS N2 - lightweight approximate Nearest Neighbor library
- PUFFINN - Parameterless and Universal Fast FInding of Nearest Neighbors
- SPTAG - A distributed approximate nearest neighborhood search (ANN) library
- PyNNDescent - A python nearest neighbor descent for approximate k nearest neighbors
- TarsosLSH - A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors
- TorchPQ - Efficient implementations of Product Quantization and its variants using Pytorch and CUDA
- Granne - Graph-based retrieval of approximate nearest neighbors witten in rust
- Hora - Efficient approximate nearest neighbor search algorithm collections library written in Rust
- Voy - A WASM vector similarity search engine written in Rust
- Chroma - The open-source embedding database for building LLM apps in Python or JavaScript with memory
- USearch - Smaller & Faster Vector Search Engine for C++, Python, JavaScript, Rust, Java, GoLang, Wolfram
- Golang vector stores collection - Chroma, PGVector interfaces
- Scalable Vector Search (SVS) - A performance library for vector similarity search
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- CassIO - CassIO is the ultimate solution for seamlessly integrating Apache Cassandra® with generative artificial intelligence and other machine learning workloads
- LangStream - LangStream is an open-source project that combines the best of event-based architectures with the latest Gen AI technologies.
- NearPy - Approximate search using different locality-sensitive hashing methods
-
Research Papers
- SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search - NEURIPS 2021
- Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors - ECCV 2018
- Accelerating Large-Scale Inference with Anisotropic Vector Quantization
- Billion-scale similarity search with GPUs
- Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs
- Optimization of Indexing Based on k-Nearest Neighbor Graph for Proximity Search in High-dimensional Data
- On Approximately Searching for Similar Word Embeddings - ACL 2016
- 
- Marqo - A semantic search engine which supports tensor search (sequence of vectors)
- txtai - Build semantic search applications and workflows
- Semantra - A multipurpose tool for semantically searching documents.
- VQLite - Simple and Lightweight Vector Search Engine
- Vexvault - 100% browser based, open source, scalable, simple, zero-cost vector search
- JVector - a pure Java, zero dependency, embedded vector search engine, used by DataStax Astra DB and Apache Cassandra.
- Weaviate - A cloud-native, real-time vector search engine
- Aquila DB - Distribution focused k-NN search algorithm
- Jina - Jina allows you to build deep learning-powered search-as-a-service.
-
Programming Languages
Categories
Sub Categories
Keywords
nearest-neighbor-search
18
approximate-nearest-neighbor-search
16
vector-search
13
similarity-search
11
search-engine
11
machine-learning
10
vector-database
9
rag
6
image-search
6
hnsw
6
embeddings
5
rust
4
information-retrieval
4
search
4
semantic-search
4
java
4
knn
4
neural-search
4
knn-search
4
ann
3
vector-search-engine
3
cloud-native
3
database
3
recommender-system
3
vector
3
retrieval-augmented-generation
3
python
3
embedding
3
faiss
3
neural-network
3
golang
3
llm
3
ai
3
k-nearest-neighbors
3
locality-sensitive-hashing
2
webassembly
2
nearest-neighbors
2
chatbot
2
hybrid-search
2
full-text-search
2
llms
2
document-retrieval
2
ai-native
2
simd
2
embedding-similarity
2
embedding-database
2
retrieval
2
chatgpt
2
lucene
2
artificial-intelligence
2