Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-vector-database
A curated list of awesome works related to high dimensional structure/vector search & database
https://github.com/dangkhoasdc/awesome-vector-database
- Google Vector Search (Vertex AI)
- Pinecone
- Weaviate - started-with-weaviate-a-beginners-guide-to-search-with-vector-databases-14bbb9285839)]
- Vespa
- txtai
- marqo
- vectara
- Epsilla
- algolia
- nucliadb
- OpenSearch
- MyScale
- QdrantCloud
- zilliz
- OpenSearch's AlibabaCloud
- Typesense's Cloud
- MongoDB Atlas Vector Search
- SuperDuperDB
- KBD.AI
- From Vespa
- Vector DB Comparison by VectorHub
- Vector DB Feature Matrix
- Faiss
- Typesense
- Qdrant
- Video tutorial
- annoy
- NGT
- pgvector
- Chroma
- LlamaIndex
- Epsilla
- jvector
- RAFT
- Vald
- Voyager
- tinyvector
- USearch
- vearch
- MRPT
- milvus
- infinity
- havenask
- chromem-go
- OasysDB - iJRL5XyL7?usp=sharing)]
- PISA
- Tantivy
- sonic
- SimSIMD
- ANN Benchmarks
- Billion-scale ANNS Benchmarks
- 2021 Result
- BEIR
- VectorDBBench - A Vector Database Benchmark Tool
- Qdrant's Vector Database Benchmarks
- MyScale's Vector Database Benchmark
- Approximate nearest neighbor search on high dimensional data—experiments, analyses, and improvement - 1488.
- Foundations of Multidimensional and Metric Data Structures
- Introduction to Information Retrieval
- Deep Learning for Search
- Foundations of Vector Retrieval
- VLDB
- [slides - tutorial-summary.pdf)]
- Image Retrieval in the Wild (CVPR20)
- Haystack
- Neural Search In Action
- Effective and Efficient: Toward Open-world Instance Re-identification
- [Slides
- [Slides
- [Slides
- SISAP Indexing Challenge
- 2023 Competition
- 2024 Competition
- Search Optimization with Query Likelihood Boosting and Two-Level Approximate Search for Edge Devices
- High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations. - 27.
- Approximate Nearest Neighbor Search in Recommender Systems
- Accelerating vector search on the GPU with RAPIDS RAFT
- CAPS: A Practical Partition Index for Filtered Similarity Search
- RTNN: accelerating neighbor search using hardware ray tracing - research/rtnn)]
- Physical vs. Logical Indexing with {IDEA}: Inverted {Deduplication-Aware} Index
- Taking two Birds with one k-NN Cache
- Ascent Similarity Caching With Approximate Indexes
- Improving approximate nearest neighbor search through learned adaptive early termination
- Deann: Speeding up kernel-density estimation using approximate nearest neighbor search
- Distance Comparison Operators for Approximate Nearest Neighbor Search: Exploration and Benchmark
- Vector search with small radiuses
- Efficient Proximity Search in Time-accumulating High-dimensional Data using Multi-level Block Indexing
- LeanVec: Search your vectors faster by making them fit.
- Approximate Nearest Neighbour Search on Dynamic Datasets: An Investigation
- Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory
- [Class Notes
- [Website
- [Paper
- Recent Approaches and Trends in Approximate Nearest Neighbor Search
- [Paper
- [Paper - dinosauria/Rayuela.jl), [nanopq](https://github.com/matsui528/nanopq)]
- [Paper
- [Paper
- [Homepage - us/research/wp-content/uploads/2013/11/pami13opq.pdf), [Code](https://kaiminghe.github.io/cvpr13/matlab_OPQ_release_v1.1.rar), [nanopq](https://github.com/matsui528/nanopq)]
- [Paper - research/faiss-quickeradc)]
- [Paper
- [Paper
- [Paper - research/google-research/tree/master/scann), [Julia Training/Inference](https://github.com/AxelvL/AHPQ.jl)]
- [Paper - Quantization)]
- [Paper
- [Paper
- [Paper
- Similarity search in the blink of an eye with compressed indices.
- Residual Quantization with Implicit Neural Codebooks
- Adanns: A framework for adaptive semantic search
- Nearest neighbor search with compact codes: A decoder perspective
- Projective Clustering Product Quantization
- Product quantizer aware inverted index for scalable nearest neighbor search
- Jointly optimizing query encoder and product quantization to improve retrieval performance
- DeltaPQ: lossless product quantization code compression for high dimensional similarity search - 3616.
- Generalized product quantization network for semi-supervised image retrieval
- Differentiable product quantization for end-to-end embedding compression
- Learning Discrete Document Representations in Web Search
- Efficient Multi-vector Dense Retrieval with Bit Vectors
- Graph-and Tree-based Indexes for High-dimensional Vector Similarity Search: Analyses, Comparisons, and Future Directions - 21.
- [Paper
- Graph based nearest neighbor search: Promises and failures
- [Paper - cv/hnsw), [Go Version](https://github.com/coder/hnsw)]
- [Paper
- [Paper
- [Paper
- [Paper
- BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU.
- Cagra: Highly parallel graph construction and approximate nearest neighbor search for gpus.
- Theoretical and Empirical Analysis of Adaptive Entry Point Selection for Graph-based Approximate Nearest Neighbor Search.
- General and practical tuning method for off-the-shelf graph-based index: Sisap indexing challenge report by team utokyo.
- Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data Segment
- ParlayANN: Scalable and Deterministic Parallel Graph-Based Approximate Nearest Neighbor Search Algorithms
- An Efficient and Robust Framework for Approximate Nearest Neighbor Search with Attribute Constraint
- Pecann: Parallel efficient clustering with graph-based approximate nearest neighbor search
- ELPIS: Graph-Based Similarity Search for Scalable Data Science - 1559.
- Worst-case performance of popular approximate nearest neighbor search implementations: Guarantees and limitations
- Optimizing Graph-based Approximate Nearest Neighbor Search: Stronger and Smarter.
- Graph-based Approximate NN Search: A Revisit
- Speed-ANN: Low-Latency and High-Accuracy Nearest Neighbor Search via Intra-Query Parallelism
- HVS: hierarchical graph structure based on voronoi diagrams for solving approximate nearest neighbor search - 258. [[Code](https://github.com/chuanxiao1983/HVS)]
- Revisiting $ k $-Nearest Neighbor Graph Construction on High-Dimensional Data: Experiments and Analyses
- Understanding and Generalizing Monotonic Proximity Graphs for Approximate Nearest Neighbor Search
- Unleashing Graph Partitioning for Large-Scale Nearest Neighbor Search
- Freshdiskann: A fast and accurate graph-based ann index for streaming similarity search
- Large-Scale Approximate k-NN Graph Construction on GPU
- ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data
- SeRF: Segment Graph for Range-Filtering Approximate Nearest Neighbor Search - 26.
- Diskann: Fast accurate billion-point nearest neighbor search on a single node.
- Constructing Tree-based Index for Efficient and Effective Dense Retrieval.
- Approximate Nearest Neighbor Search with Window Filters
- ProMIPS: Efficient high-dimensional C-approximate maximum inner product search with a lightweight index
- GTS: GPU-based Tree Index for Fast Similarity Search
- AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval
- Awesome Papers on Learning to Hash
- [Paper
- [Paper
- [Paper - to-hash/blob/master/itq.py), [Matlab code](https://github.com/dangkhoasdc/sah/tree/master/itq)]
- Binary Embedding-based Retrieval at Tencent
- Binary code based hash embedding for web-scale applications
- Unsupervised Online Hashing with Multi-Bit Quantization
- Point-to-hyperplane nearest neighbor search beyond the unit hypersphere
- Fast Search on Binary Codes by Weighted Hamming Distance
- Fast top-K cosine similarity search through XOR-friendly binary quantization on GPUs
- PM-LSH: A fast and accurate LSH framework for high-dimensional approximate NN search - 655.
- Scalable Nearest Neighbor Search with Compact Codes
- Locality-sensitive hashing scheme based on longest circular co-substring
- Spann: Highly-efficient billion-scale approximate nearest neighbor search
- Index-based, high-dimensional, cosine threshold querying with optimality guarantees - 83.
- Semi-convex hull tree: Fast nearest neighbor queries for large scale data on GPUs
- Practical near neighbor search via group testing - 9962. [[Supplement](https://proceedings.neurips.cc/paper_files/paper/2021/file/5248e5118c84beea359b6ea385393661-Supplemental.pdf)]
- iDEC: indexable distance estimating codes for approximate nearest neighbor search
- VHP: approximate nearest neighbor search via virtual hypersphere partitioning - 1455.
- Maze: A Cost-Efficient Video Deduplication System at Web-scale
- LANNS: a web-scale approximate nearest neighbor lookup system
- OneSparse: A Unified System for Multi-index Vector Search
- [Paper
- What is a Vector Database?
- What makes each one different?
- eBay’s Blazingly Fast Billion-Scale Vector Similarity Engine
- Computer Vision Meetup: Computer Vision Applications at Scale with Vector Databases
- How to choose your vector database in 2023?
- Do we really need a specialized vector database?
- Vector database is not a separate database category
- Vector Databases: A First-Principles Approach
- Vector Search RAG Tutorial – Combine Your Data with LLMs with Advanced Search
- Efficient Vector Similarity Search in Recommender Workflows Using Milvus with NVIDIA Merlin
- Vector Databases: A Beginner’s Guide!
- Vector Database and Spring IA
- How to handle a Million Vector Embeddings in the RAG Applications
- How Meilisearch Updates a Millions Vector Embeddings Database in Under a Minute
- Common Pitfalls To Avoid When Using Vector Databases
- Getting Started With Vector Databases
- Awesome Vector Search Engine
Keywords
vector-search
12
search-engine
10
nearest-neighbor-search
9
approximate-nearest-neighbor-search
7
similarity-search
7
information-retrieval
7
machine-learning
6
search
5
vector-database
5
semantic-search
5
hnsw
5
rag
4
python
4
database
4
transformers
3
retrieval-augmented-generation
3
llms
3
knn-search
3
vector-search-engine
3
embeddings
3
llm
3
full-text-search
2
vector
2
deep-learning
2
knn
2
golang
2
pytorch
2
nearest-neighbors
2
clustering
2
java
2
recommender-system
2
nlp
2
neural-search
2
mlops
2
large-language-models
2
image-search
2
rust
2
hybrid-search
2
simd
2
building-blocks
1
cuda
1
text-search
1
distance
1
gpu
1
linear-algebra
1
neighborhood-methods
1
primitives
1
random-sampling
1
solvers
1
sparse
1