awesome-vector-database
A curated list of awesome works related to high dimensional structure/vector search & database
https://github.com/dangkhoasdc/awesome-vector-database
Last synced: 8 days ago
JSON representation
-
Other Approaches
-
Others
- Billion-scale ANNS Benchmarks
- Qdrant's Vector Database Benchmarks
- Foundations of Multidimensional and Metric Data Structures
- Introduction to Information Retrieval
- Deep Learning for Search
- VLDB
- [slides - tutorial-summary.pdf)]
- Image Retrieval in the Wild (CVPR20)
- Neural Search In Action
- Effective and Efficient: Toward Open-world Instance Re-identification
- [Slides
- [Slides
- [Slides
- Search Optimization with Query Likelihood Boosting and Two-Level Approximate Search for Edge Devices
- Foundations of Vector Retrieval
- 2021 Result
- Approximate Nearest Neighbor Search in Recommender Systems
- Accelerating vector search on the GPU with RAPIDS RAFT
- Approximate nearest neighbor search on high dimensional data—experiments, analyses, and improvement - 1488.
- CAPS: A Practical Partition Index for Filtered Similarity Search
- RTNN: accelerating neighbor search using hardware ray tracing - research/rtnn)]
- Physical vs. Logical Indexing with {IDEA}: Inverted {Deduplication-Aware} Index
- Taking two Birds with one k-NN Cache
- Ascent Similarity Caching With Approximate Indexes
- Improving approximate nearest neighbor search through learned adaptive early termination
- Deann: Speeding up kernel-density estimation using approximate nearest neighbor search
- SISAP Indexing Challenge
- 2023 Competition
- 2024 Competition
- Distance Comparison Operators for Approximate Nearest Neighbor Search: Exploration and Benchmark
- Vector search with small radiuses
- Efficient Proximity Search in Time-accumulating High-dimensional Data using Multi-level Block Indexing
- Vector search with small radiuses
- LeanVec: Search your vectors faster by making them fit.
- High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations. - 27.
- Approximate Nearest Neighbour Search on Dynamic Datasets: An Investigation
- Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory
- CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion.
- Approximate nearest neighbor search on high dimensional data—experiments, analyses, and improvement - 1488.
- CAPS: A Practical Partition Index for Filtered Similarity Search
- RTNN: accelerating neighbor search using hardware ray tracing - research/rtnn)]
- Ascent Similarity Caching With Approximate Indexes
- SimSIMD
- BEIR
- MyScale's Vector Database Benchmark
- VectorHub - source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
- Distance Comparison Operators for Approximate Nearest Neighbor Search: Exploration and Benchmark
- Worst-case Performance of Popular Approximate Nearest Neighbor Search Implementations: Guarantees and Limitations
- Results of the Big ANN: NeurIPS'23 competition.
- IntelLabs's Vector Search Datasets
- Vector Database Group @ NTU - NTU)]
- Foundations of Multidimensional and Metric Data Structures
- vector-io
- ANN Benchmarks
- Taking two Birds with one k-NN Cache
- Worst-case Performance of Popular Approximate Nearest Neighbor Search Implementations: Guarantees and Limitations
- Operational Advice for Dense and Sparse Retrievers: HNSW, Flat, or Inverted Indexes?.
- Pacmann: Efficient Private Approximate Nearest Neighbor Search.
- Learning Cluster Representatives for Approximate Nearest Neighbor Search.
- VLDB
- VectorDBZ
-
Quantization
- [Paper
- [Paper
- [Paper
- [Paper
- [Paper - research/google-research/tree/master/scann), [Julia Training/Inference](https://github.com/AxelvL/AHPQ.jl)]
- [Paper - Quantization)]
- [Paper
- [Paper
- Similarity search in the blink of an eye with compressed indices.
- Residual Quantization with Implicit Neural Codebooks
- Adanns: A framework for adaptive semantic search
- Projective Clustering Product Quantization
- Product quantizer aware inverted index for scalable nearest neighbor search
- DeltaPQ: lossless product quantization code compression for high dimensional similarity search - 3616.
- Generalized product quantization network for semi-supervised image retrieval
- Differentiable product quantization for end-to-end embedding compression
- Efficient Multi-vector Dense Retrieval with Bit Vectors
- [Paper - research/faiss-quickeradc)]
- Nearest neighbor search with compact codes: A decoder perspective
- Jointly optimizing query encoder and product quantization to improve retrieval performance
- RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search.
- [Paper
- [Paper
- [Paper
- Projective Clustering Product Quantization
- Efficient Multi-vector Dense Retrieval with Bit Vectors
- [Paper - research/google-research/tree/master/scann), [Julia Training/Inference](https://github.com/AxelvL/AHPQ.jl)]
- [Paper
- [Paper
- Similarity search in the blink of an eye with compressed indices.
- Residual Quantization with Implicit Neural Codebooks
- DeltaPQ: lossless product quantization code compression for high dimensional similarity search - 3616.
- Differentiable product quantization for end-to-end embedding compression
- Learning Discrete Document Representations in Web Search
- Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search.
- Incremental IVF Index Maintenance for Streaming Vector Search.
- Fast High-dimensional Approximate Nearest Neighbor Search with Efficient Index Time and Space.
- Learned Data Compression: Challenges and Opportunities for the Future.
- Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks.
- Cracking Vector Search Indexes
- Efficient Dynamic Indexing for Range Filtered Approximate Nearest Neighbor Search
- OrchANN: A Unified I/O Orchestration Framework for Skewed Out-of-Core Vector Search.
- Curator: Efficient Vector Search with Low-Selectivity Filters.
- Pyramid Product Quantization for Approximate Nearest Neighbor Search.
- Quantization Meets Projection: A Happy Marriage for Approximate k-Nearest Neighbor Search.
- Multiscale quantization for fast similarity search.
- BBC: Improving Large-k Approximate Nearest Neighbor Search with a Bucket-based Result Collector.
- A real-time adaptive multi-stream gpu system for online approximate nearest neighborhood search.
- EIVF: Efficient IVFPQ Search for On-Device ARM Processors. - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2026.
- Large-Scale Data Parallelization of Product Quantization and Inverted Indexing Using Dask.
- TTVI: A Two-Tier Vector Index for Low-WAL Approximate Nearest Neighbor Search in Databases.
- CS-PQ: Cache-Friendly SIMD Product Quantization for Large-Scale ANNS Index Construction.
-
Related Lists
-
Survey
- [Paper
- [Paper
- Recent Approaches and Trends in Approximate Nearest Neighbor Search
- Recent Approaches and Trends in Approximate Nearest Neighbor Search
- Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devices.
- Cloud-Native Vector Search: A Comprehensive Performance Analysis.
- Disk-Resident Vector Similarity Search: A Survey.
- A Survey on Query Processing in Vector Databases.
-
Systems
- Maze: A Cost-Efficient Video Deduplication System at Web-scale
- LANNS: a web-scale approximate nearest neighbor lookup system
- OneSparse: A Unified System for Multi-index Vector Search
- LANNS: a web-scale approximate nearest neighbor lookup system
- CoTra: Towards Efficient and Scalable Distributed Vector Search with RDMA. - 27.
- LindormVector: A Distributed Vector Engine on a Cloud-Native Multi-Model NoSQL Database.
-
Texts
-
🎄Tree-based Methods
- Diskann: Fast accurate billion-point nearest neighbor search on a single node.
- Approximate Nearest Neighbor Search with Window Filters
- GTS: GPU-based Tree Index for Fast Similarity Search
- AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval
- ProMIPS: Efficient high-dimensional C-approximate maximum inner product search with a lightweight index
- AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval
- Constructing Tree-based Index for Efficient and Effective Dense Retrieval.
- Efficient Index Layout and Search Strategy for Large-scale High-dimensional Vector Similarity Search. - 27.
-
Tree-based Methods
-
Uncategorized
-
Uncategorized
- Google Vector Search (Vertex AI)
- Pinecone
- Vespa
- algolia
- nucliadb
- OpenSearch
- MyScale
- QdrantCloud
- zilliz
- OpenSearch's AlibabaCloud
- MongoDB Atlas Vector Search
- KBD.AI
- Weaviate - started-with-weaviate-a-beginners-guide-to-search-with-vector-databases-14bbb9285839)]
- txtai
- marqo
- Meilisearch
- vectara
- Denser Retriever
- MyScale
-
Programming Languages
Categories
Graph-based Methods
69
Others
62
Quantization
53
📰 Articles & Talks
48
Multidimensional data / Vectors
42
Hashing
20
Uncategorized
19
Other Approaches
17
🎄Tree-based Methods
8
Survey
8
Systems
6
Courses
4
Texts
3
Comparisons
3
Tree-based Methods
1
Related Lists
1
:chart_with_upwards_trend: Evaluation & Metrics
1
Sub Categories
Keywords
vector-search
16
search-engine
13
vector-database
11
similarity-search
10
nearest-neighbor-search
9
approximate-nearest-neighbor-search
9
rag
9
machine-learning
9
llm
8
information-retrieval
7
hnsw
7
search
6
embeddings
6
rust
5
semantic-search
4
image-search
4
knn-search
4
ai
4
nearest-neighbors
4
vector
4
weaviate
3
benchmark
3
python
3
vector-search-engine
3
milvus
3
simd
3
pinecone
3
qdrant
3
recommender-system
3
vector-store
3
clustering
3
ann
3
anns
2
cuda
2
embedding
2
golang
2
distance
2
llms
2
full-text-search
2
gpu
2
neighborhood-methods
2
chromadb
2
chroma
2
retrieval-augmented-generation
2
vectordb
2
nlp
2
vectordatabase
2
retrieval
2
database
2
vector-similarity
2