awesome-vector-databases
A curated list of vector database solutions, libraries, and resources for AI applications - https://vectordb.works
https://github.com/ever-works/awesome-vector-databases
Last synced: 6 days ago
JSON representation
-
Core Vector Databases
- NucliaDB - NucliaDB is a versatile vector database designed for data scientists and machine learning experts working with HuggingFace and other data pipeline platforms. Built on Tantivy in Rust and Python, it efficiently indexes large datasets with multi-tenant support. ([Read more](/details/nucliadb.md)) `Open Source` `Rust` `Multi Tenant`
- SuperDuperDB - Open-source database that turns any DB into a vector DB with AI capabilities. ([Read more](/details/superduperdb.md)) `Open Source` `Ai Native` `Flexible`
- TiDB Vector Search - Open-source distributed SQL database with integrated vector search for storing embeddings alongside relational data, offering strong SQL-based filtering, hybrid search, and high scalability for production RAG and AI applications. ([Read more](/details/tidb-vector-search.md)) `Open Source` `Hybrid Search` `Distributed` `Sql`
- TileDB Vector Search - TileDB Vector Search is a scalable open-source vector database that stores and performs approximate nearest neighbor searches on high-dimensional dense and sparse vectors using TileDB's multi-dimensional array storage for petabyte-scale data. Key features include Vamana graph and IVF-PQ indexing, metadata filtering, multi-tenancy, serverless scalability on object stores like S3, and APIs in Python/C++ with gRPC support. Suited for RAG pipelines, recommendation systems, and anomaly detection; excels in sparse vector efficiency and cost savings compared to Milvus or Pinecone, while scaling better than Faiss for large production deployments. ([Read more](/details/tiledb-vector-search.md)) `Open Source` `Scalable ANN` `2026 Production` `Production Use` `2026 Ready`
- Vector.ai - Vector.ai is a managed vector search platform that provides an API for creating, managing, and searching vector indices. It is designed to handle large volumes of high-dimensional data for efficient similarity search in machine learning and AI applications. ([Read more](/details/vectorai.md)) `Managed` `Autoscaling` `Ml Integration`
- VelesDB - Embedded vector + graph + columnar database with HNSW indexing. ([Read more](/details/velesdb.md)) `Embedded` `Rust` `Open Source` `Graph`
- Vexvault - Vexvault is a 100% browser-based document storage system designed to make files and data accessible to AI applications like ChatGPT while ensuring user privacy and security. It aims to be easy to integrate and use. ([Read more](/details/vexvault.md)) `Open Source` `Browser Based` `Privacy Focused`
-
curated-resource-lists
- Vertex AI Embeddings - Google Cloud’s managed embeddings service that generates text and multimodal vector representations for search, retrieval, and other AI applications. Frequently used alongside vector databases or vector search services to populate and update vector indexes. ([Read more](/details/vertex-ai-embeddings.md))
- Vertex AI Pipelines - A serverless ML orchestration service on Google Cloud used to build automated pipelines that can generate embeddings and create or update vector search indexes, supporting MLOps workflows for vector database–backed search and recommendation systems. ([Read more](/details/vertex-ai-pipelines.md))
-
Curated Resource Lists
- Awesome papers and technical blogs on vector DB - A curated collection of papers and technical blogs focused on vector databases, semantic-based vector search, and approximate nearest neighbor search (ANN Search). These resources are essential for understanding and building large-scale information retrieval systems and vector databases. ([Read more](/details/awesome-papers-and-technical-blogs-on-vector-db.md)) `vector databases` `research` `blogs` `ANN` `semantic search`
- awesome-vector-database - A curated awesome list compiling resources, tools, vector databases, and research relevant to vector search and storage. Serves as a meta-resource for exploring the vector database ecosystem. ([Read more](/details/awesome-vector-database.md)) `vector databases` `resources` `tools` `awesome list`
- awesome-vector-search - A curated collection of libraries, services, and research papers focused on vector search, including vector database technologies and related resources. ([Read more](/details/awesome-vector-search.md)) `vector search` `libraries` `resources` `papers`
- IntelLabs's Vector Search Datasets - A collection of datasets curated by Intel Labs specifically for evaluating and benchmarking vector search algorithms and databases. ([Read more](/details/intellabss-vector-search-datasets.md)) `datasets` `vector search` `benchmark` `evaluation`
- Kinomoto.Mag AI - Kinomoto.Mag AI is a blog focused on AI tools, news, and tutorials, including curated lists of vector databases for AI applications. It serves as a resource hub for those interested in the latest innovations in vector databases and AI technologies. ([Read more](/details/kinomotomag-ai.md)) `blog` `AI` `resources` `vector databases`
- KShivendu/awesome-vector-search - A curated list of awesome projects and research related to vector search, including dedicated vector databases, vector search libraries, performance benchmarks, and cost analysis resources. ([Read more](/details/kshivenduawesome-vector-search.md)) `awesome list` `vector search` `resources` `open-source`
- LibHunt Vector Database Projects - A curated collection of open-source vector database projects, providing a centralized list for exploring and comparing solutions designed for vector search and AI applications. ([Read more](/details/libhunt-vector-database-projects.md)) `open-source` `vector databases` `resources` `AI`
- vector-search-papers - A curated GitHub repository of research papers and technical blogs focused on vector search, approximate nearest neighbor search (ANN Search), and vector databases. This resource serves as a comprehensive directory for foundational and cutting-edge research, making it highly relevant for anyone building or exploring vector database technologies. ([Read more](/details/vector-search-papers.md)) `vector search` `research` `papers` `ANN` `vector databases`
- VectorDB.Works - A web-based directory of vector database solutions, libraries, and resources for AI applications, serving as an accessible resource for exploring and comparing vector databases. ([Read more](/details/vectordbworks.md)) `resources` `vector databases` `directory` `AI`
- VectorHub - VectorHub is a resource and learning platform for developers and ML architects interested in integrating vector retrieval and search capabilities into their machine learning stacks, directly supporting vector database adoption and usage. ([Read more](/details/vectorhub.md)) `resources` `vector search` `learning` `open-source`
- weaviate-examples - Examples and resources for Weaviate, a popular open-source vector database optimized for storing and searching vector embeddings at scale. ([Read more](/details/weaviate-examples.md)) `Weaviate` `examples` `resources` `vector embeddings`
- XiaomingX/awesome-vector-database - A curated directory of resources, tools, tutorials, and libraries dedicated to vector databases, focusing on efficient data retrieval, similarity search, and machine learning applications. ([Read more](/details/xiaomingxawesome-vector-database.md)) `vector databases` `resources` `tutorials` `similarity search`
- Mastering Multimodal RAG - A course focused on mastering multimodal Retrieval Augmented Generation (RAG) and embeddings, which are fundamental components often stored and managed by vector databases. ([Read more](/details/mastering-multimodal-rag.md)) `RAG` `multimodal` `embeddings` `tutorials`
- OpenAI Cookbook - A collection of examples and guides from OpenAI, including best practices for working with embeddings, which are fundamental to vector search and vector database applications. ([Read more](/details/openai-cookbook.md)) `OpenAI` `embeddings` `resources`
- VectorDB.Works - A web-based directory of vector database solutions, libraries, and resources for AI applications, serving as an accessible resource for exploring and comparing vector databases. ([Read more](/details/vectordbworks.md)) `resources` `vector databases` `directory` `AI`
- MongoDB Vector Search - MongoDB Vector Search turns MongoDB into a full-featured vector database, enabling approximate and exact nearest neighbor search over vector embeddings stored alongside operational data. It supports semantic similarity search, retrieval-augmented generation (RAG) for AI applications, and lets you combine vector search with full‑text search and structured filters in the same query. Available on supported MongoDB Atlas clusters, it integrates with popular AI frameworks and services for building intelligent, agentic systems. ([Read more](/details/mongodb-vector-search.md))
- Survey of Vector Database Management Systems - A comprehensive 2023 survey that systematically analyzes the design, architecture, indexing techniques, and system implementations of modern vector database management systems, serving as a foundational reference for understanding the vector database ecosystem used in AI applications. ([Read more](/details/survey-of-vector-database-management-systems.md))
- Vector DB Feature Matrix - A collaboratively maintained Google Sheets matrix comparing features, capabilities, and characteristics of many vector databases and approximate nearest neighbor libraries, useful for selecting solutions for AI and similarity search applications. ([Read more](/details/vector-db-feature-matrix.md))
- Algolia Vector Search - Algolia’s vector search capability that augments its search-as-a-service platform with semantic and similarity search using embeddings. ([Read more](/details/algolia-vector-search.md))
- Chroma - Chroma is an open-source AI-native vector database that provides semantic, full-text, and regex search as a memory layer for LLM and RAG applications. ([Read more](/details/chroma.md))
- chromem-go - chromem-go is a Go client and implementation for Chroma-like vector database functionality, enabling embedding storage and similarity search in Go applications. ([Read more](/details/chromem-go.md))
- Databricks Vector Search - Databricks Vector Search is a managed vector search capability in Databricks that lets you create and maintain vector search indexes over Delta tables. It supports multiple modes for providing vector embeddings, including Databricks-computed embeddings (Delta Sync Index with managed embeddings), self-managed precomputed embeddings (Delta Sync Index with self-managed embeddings), and Direct Vector Access Index where clients directly manage vector updates via REST APIs. It is designed for AI and RAG-style applications built on top of the Databricks Lakehouse, enabling similarity search with metadata filters and tight integration with Unity Catalog and Delta Lake. ([Read more](/details/databricks-vector-search.md))
- Efficient Multi-vector Dense Retrieval with Bit Vectors (emvb) - emvb is an open-source implementation of the "Efficient Multi-vector Dense Retrieval with Bit Vectors" method, providing a specialized vector-search index for multi-vector dense retrieval using compact bit-vector representations to accelerate ANN search and reduce memory usage in vector database and retrieval systems. ([Read more](/details/efficient-multi-vector-dense-retrieval-with-bit-vectors-emvb.md))
- Foundations of Vector Retrieval - A comprehensive survey/tutorial paper that formalizes the principles, models, and system designs for vector retrieval, offering theoretical and practical foundations for modern vector databases and vector search engines. ([Read more](/details/foundations-of-vector-retrieval.md))
- Image Retrieval in the Wild - A CVPR 2020 tutorial on large-scale image retrieval in unconstrained environments, including methods and system considerations for vector-based image search relevant to vector database and ANN applications. ([Read more](/details/image-retrieval-in-the-wild.md)) `tutorials` `multimodal` `vector search`
- Implement two-tower retrieval for large-scale candidate generation - A Google Cloud reference architecture demonstrating an end-to-end two-tower retrieval system for large-scale candidate generation that uses Vertex AI and vector similarity search concepts to learn and serve semantic similarity between entities. ([Read more](/details/implement-two-tower-retrieval-for-large-scale-candidate-generation.md)) `RAG` `semantic search` `architectures`
- Introduction to Information Retrieval - Foundational IR textbook that includes content on vector‑space models and retrieval, providing essential background for understanding vector search and hybrid retrieval in modern vector databases. ([Read more](/details/introduction-to-information-retrieval.md)) `resources` `search` `learning`
- Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search - Research paper proposing lossless compression techniques for vector identifiers in approximate nearest neighbor (ANN) search systems, aiming to reduce memory footprint and improve efficiency in large-scale vector databases and similarity search engines. ([Read more](/details/lossless-compression-of-vector-ids-for-approximate-nearest-neighbor-search.md))
- Mosaic AI Vector Search - Mosaic AI Vector Search is Databricks’ managed vector database and similarity search service for AI applications, providing high‑capacity, high‑performance vector indexing and querying with configurable endpoint types, including standard and storage‑optimized endpoints that scale to over one billion 768‑dimensional vectors. ([Read more](/details/mosaic-ai-vector-search.md))
- MyScale Vector Database Benchmark - Benchmark framework and results from MyScale for comparing vector database and ANN index performance using large‑scale datasets and common query workloads relevant to AI applications. ([Read more](/details/myscale-vector-database-benchmark.md))
- Neural Search in Action - A CVPR 2023 tutorial that demonstrates neural search systems in practice, including vector representations, similarity search, and scalable retrieval architectures closely related to vector databases. ([Read more](/details/neural-search-in-action.md)) `tutorials` `neural search` `vector search`
- Oracle AI Vector Search - Oracle AI Vector Search is Oracle’s integrated vector search capability within Oracle AI Database 26ai, enabling storage and querying of vector embeddings alongside traditional business data. It introduces a native VECTOR data type and supports high‑dimensional semantic similarity search for AI workloads such as chatbots, recommendation systems, anomaly detection, and multimedia search, while allowing embeddings to be used directly with Oracle machine learning algorithms. ([Read more](/details/oracle-ai-vector-search.md))
- Passing the Baton: High Throughput Distributed Disk-Based Vector Search with BatANN - BatANN is a distributed, disk-based vector search system designed for high-throughput approximate nearest neighbor queries at scale, providing an architecture and methods applicable to large-scale vector databases that need efficient storage beyond memory. ([Read more](/details/passing-the-baton-high-throughput-distributed-disk-based-vector-search-with-batann.md))
- PDX: A Data Layout for Vector Similarity Search - PDX is a proposed data layout optimized for vector similarity search, focusing on memory and access efficiency for high-dimensional embeddings, making it relevant for the internal storage design of vector databases and ANN indexes. ([Read more](/details/pdx-a-data-layout-for-vector-similarity-search.md))
- Understanding and Applying Text Embeddings (Vertex AI Short Course) - Short course by DeepLearning.AI and Google Cloud that teaches how to generate and use text embeddings with the Vertex AI Embeddings API for semantic search, classification, and question-answering systems, providing foundational knowledge for working with vector databases and retrieval. ([Read more](/details/understanding-and-applying-text-embeddings-vertex-ai-short-course.md))
- Vector Database Cloud - Vector Database Cloud is a managed cloud platform and ecosystem for building, deploying, and operating applications that use vector databases such as Qdrant and Milvus. It provides APIs, dashboards, and tooling tailored for AI and embedding-based workloads, enabling use cases like content recommendation and real-time fraud detection. ([Read more](/details/vector-database-cloud.md))
- Vector Search - Vector Search is Google Cloud Vertex AI’s managed vector search engine built on the ScaNN algorithm. It provides scalable, high‑performance vector similarity search for semantic search, recommendations, and generative AI applications, offering enterprise‑grade availability and the same underlying technology used in Google products like Search, YouTube, and Google Play. ([Read more](/details/vector-search.md))
- Vector Search and Embeddings (Google Cloud Skills Boost Course) - Google Cloud Skills Boost course that covers the fundamentals of vector search and text embeddings and shows how to build a vector search application on Vertex AI, including conceptual lessons, demos, and a practice lab. ([Read more](/details/vector-search-and-embeddings-google-cloud-skills-boost-course.md))
- vector-io - Comprehensive vector data tooling library focused on working with vector embeddings and ANN data, useful for building, evaluating, and managing datasets and pipelines for vector databases and similarity search systems. ([Read more](/details/vector-io.md))
- Vertex AI Embeddings - Google Cloud’s managed embeddings service that generates text and multimodal vector representations for search, retrieval, and other AI applications. Frequently used alongside vector databases or vector search services to populate and update vector indexes. ([Read more](/details/vertex-ai-embeddings.md))
- Vertex AI Feature Store - A managed feature store on Google Cloud that serves real-time feature data, often used alongside vector search to enrich or filter results returned from vector indexes in production recommendation and search systems. ([Read more](/details/vertex-ai-feature-store.md))
- Vertex AI Pipelines - A serverless ML orchestration service on Google Cloud used to build automated pipelines that can generate embeddings and create or update vector search indexes, supporting MLOps workflows for vector database–backed search and recommendation systems. ([Read more](/details/vertex-ai-pipelines.md))
- Vertex AI Search ranking API - A Google Cloud API that reranks documents based on semantic relevance using pretrained language models. It complements vector search by improving result ordering for content retrieved from vector databases or vector indexes. ([Read more](/details/vertex-ai-search-ranking-api.md))
- WARP: An Efficient Engine for Multi-Vector Retrieval - WARP is a research engine for efficient multi-vector retrieval, designed to improve performance of systems that store and search multiple embeddings per document—such as modern vector databases for RAG and semantic search workloads. ([Read more](/details/warp-an-efficient-engine-for-multi-vector-retrieval.md))
- VLDB - New Trends in High-D Vector Similarity Search (Tutorial) - A VLDB conference tutorial focused on new trends and techniques for high-dimensional vector similarity search, covering core algorithms and system designs that underpin modern vector databases and large-scale ANN search. ([Read more](/details/vldb-new-trends-in-high-d-vector-similarity-search-tutorial.md))
- MongoDB Vector Search - MongoDB Vector Search turns MongoDB into a full-featured vector database, enabling approximate and exact nearest neighbor search over vector embeddings stored alongside operational data. It supports semantic similarity search, retrieval-augmented generation (RAG) for AI applications, and lets you combine vector search with full‑text search and structured filters in the same query. Available on supported MongoDB Atlas clusters, it integrates with popular AI frameworks and services for building intelligent, agentic systems. ([Read more](/details/mongodb-vector-search.md))
- Building Applications with Vector Databases - DeepLearning.AI course teaching six practical vector database applications using Pinecone, including RAG for LLMs, recommender systems, and hybrid search combining images and text. ([Read more](/details/building-applications-with-vector-databases.md)) `Learning` `Tutorials` `Rag`
- Vector Database Market Trends 2026 - Comprehensive overview of vector database evolution in 2026, including the shift to vectors as data types, PostgreSQL dominance, 400% adoption surge, and $10.6B projected market by 2032. ([Read more](/details/vector-database-market-trends-2026.md)) `Market` `Trends` `Survey`
- Embedding Model Selection Guide - Comprehensive guide to choosing embedding models covering performance, cost, domain specialization, multilingual support, and trade-offs between general-purpose and specialized models. ([Read more](/details/embedding-model-selection-guide.md)) `Embeddings` `Models` `selection`
- GraphAcademy Knowledge Graph and GraphRAG Course - Free online courses from Neo4j GraphAcademy teaching how to build RAG systems on knowledge graphs. Covers fundamentals of combining graph databases with vector search for more accurate and explainable AI applications. ([Read more](/details/graphacademy-knowledge-graph-and-graphrag-course.md)) `Learning` `Tutorials` `Knowledge Graph`
- LangChain & Vector Databases in Production - Free comprehensive course from Activeloop with 60+ lessons and 10+ practical projects, teaching production-ready LLM applications with vector databases, trusted by 10,000+ engineers. ([Read more](/details/langchain-vector-databases-in-production.md)) `Learning` `Langchain` `Rag`
- Vector Database Benchmarking - Comprehensive guide to benchmarking vector databases covering performance testing methodologies, standard benchmarks like ANN-Benchmarks, and best practices for evaluating throughput, latency, and accuracy. ([Read more](/details/vector-database-benchmarking.md)) `benchmarking` `Performance` `Testing`
- Vector Database Cost Optimization - Strategies for reducing vector database costs including quantization, dimension reduction, efficient indexing, storage tiering, and choosing cost-effective deployment options. ([Read more](/details/vector-database-cost-optimization.md)) `Cost Optimization` `economics` `storage`
- Vector Database Fundamentals (Coursera) - IBM's comprehensive specialization providing job-ready vector database skills in one month, covering foundational knowledge for LLM-powered AI similarity searches, available for free enrollment. ([Read more](/details/vector-database-fundamentals-coursera.md)) `Learning` `Tutorials` `Certification`
- Vector Database Observability - Comprehensive guide to monitoring vector databases including key metrics, logging strategies, tracing, alerting, and debugging techniques for production vector search systems. ([Read more](/details/vector-database-observability.md)) `Monitoring` `Observability` `Operations`
- Awesome-Context-Engineering - A comprehensive curated survey on Context Engineering covering the progression from prompt engineering to production-grade AI systems. The repository contains hundreds of papers, frameworks, and implementation guides for LLMs and AI agents, serving as a centralized reference for researchers and practitioners. ([Read more](/details/awesome-context-engineering.md)) `Github` `Context Engineering` `Llm`
-
Data Integration & Migration
- Attu - Attu is a graphical user interface (GUI) tool for managing and administering Milvus vector databases. It simplifies tasks such as data exploration, schema management, and monitoring, making Milvus more accessible for a wide range of users. ([Read more](/details/attu.md)) `GUI` `management` `Milvus` `open-source`
- Birdwatcher - Birdwatcher is a system debugging tool designed for the Milvus vector database. It provides advanced diagnostics to help developers and operators understand and troubleshoot Milvus deployments, ensuring robust vector search operations. ([Read more](/details/birdwatcher.md)) `debugging` `Milvus` `management` `open-source`
- Milvus Backup Tool - Milvus Backup Tool provides backup and restore functionalities for Milvus vector databases, ensuring data safety and disaster recovery capabilities. Also referred to as Milvus Backup. ([Read more](/details/milvus-backup-tool.md)) `Milvus` `backup` `restore` `disaster recovery`
- Milvus CDC - Milvus CDC (Change Data Capture) is a component of the Milvus ecosystem that enables data synchronization between Milvus and other systems. It is useful for maintaining up-to-date vector data pipelines and supporting real-time vector search applications. ([Read more](/details/milvus-cdc.md)) `Milvus` `data synchronization` `real-time` `vector databases`
- Milvus Connectors - Milvus Connectors, such as the Spark-Milvus Connector, enable seamless integration of Milvus vector databases with third-party tools like Apache Spark for machine learning and data processing workflows. ([Read more](/details/milvus-connectors.md)) `Milvus` `integration` `machine learning` `Apache Spark`
- MindsDB Milvus Integration - MindsDB provides an integration with Milvus, enabling users to connect and manage vector data using SQL-like queries. This integration brings federated AI query capabilities across structured and unstructured data with Milvus as the vector database backend. ([Read more](/details/mindsdb-milvus-integration.md)) `Milvus` `integration` `AI` `SQL`
- Vector Transport Service (VTS) - Vector Transport Service (VTS) is a tool for transporting vector data efficiently between Milvus clusters or environments, supporting large-scale data migration and synchronization. Vector Transmission Services (VTS) are tools for transferring data between Milvus and various data sources (like Zilliz clusters, Elasticsearch, Postgres/PgVector, or other Milvus instances), facilitating vector data migration and integration. ([Read more](/details/vector-transport-service-vts.md)) `vector data` `migration` `integration` `Milvus`
- MindsDB Milvus Integration - MindsDB provides an integration with Milvus, enabling users to connect and manage vector data using SQL-like queries. This integration brings federated AI query capabilities across structured and unstructured data with Milvus as the vector database backend. ([Read more](/details/mindsdb-milvus-integration.md)) `Milvus` `integration` `AI` `SQL`
- Airbyte Milvus Connector - The Airbyte Milvus connector lets users sync data from various Airbyte-supported sources into Milvus as a destination, enabling low-code vector data ingestion pipelines. ([Read more](/details/airbyte-milvus-connector.md)) `integration` `migration` `vector data`
- Kafka Connect Milvus Connector - The Kafka Connect Milvus Connector is a plugin for Kafka Connect that streams data into and out of Milvus, supporting real-time vector data ingestion pipelines. ([Read more](/details/kafka-connect-milvus-connector.md)) `integration` `real-time` `vector data`
- Milvus Destination for Fivetran - The Milvus destination in Fivetran enables automated ELT pipelines that load data into Milvus as a vector database, supporting AI and similarity search workloads. ([Read more](/details/milvus-destination-for-fivetran.md)) `integration` `ETL` `vector data`
- Spark-Milvus Connector - The Spark-Milvus Connector is an integration that allows Apache Spark jobs to read from and write to Milvus, enabling scalable ETL and analytics workflows for vector data. ([Read more](/details/spark-milvus-connector.md)) `integration` `Apache Spark` `vector data`
- VTS (Vector Transfer Service) - VTS is a data migration and connector service for Milvus that simplifies moving and synchronizing vector data between Milvus instances and external systems. ([Read more](/details/vts-vector-transfer-service.md)) `migration` `data synchronization` `Milvus`
- Aryn DocParse - A compound AI system for parsing, chunking, enriching, and storing unstructured documents at scale, trained on 80k+ enterprise documents and delivering up to 6x better accuracy and 5x cost savings compared to alternative systems. ([Read more](/details/aryn-docparse.md)) `Document Parsing` `Rag` `Data Preparation`
- Kanister for Vector Database Backup - Open-source CNCF Sandbox project enabling efficient and secure backup and restore strategies for vector databases on Kubernetes with cloud-native integration. ([Read more](/details/kanister-for-vector-database-backup.md)) `Backup` `Kubernetes` `Disaster Recovery`
- LlamaHub - Open-source repository with 160+ community-created data loaders, readers, tools, and connectors for LlamaIndex applications, covering formats from PDFs to Notion databases. ([Read more](/details/llamahub.md)) `Data Integration` `Loaders` `Open Source`
- Sycamore - An open-source, LLM-powered document processing engine for ETL, RAG, and analytics on unstructured data, featuring a DocSet abstraction similar to Apache Spark and delivering 6x more accurate data chunking with 2x improved recall for hybrid search. ([Read more](/details/sycamore.md)) `Document Processing` `Etl` `Open Source`
- VectorETL - Powerful and flexible ETL framework designed to streamline the process of extracting data from various sources, transforming it into vector embeddings, and loading these embeddings into a range of vector databases. Requires no code to execute end-to-end processes. ([Read more](/details/vectoretl.md)) `Etl` `No Code` `Open Source`
- MindsDB Milvus Integration - MindsDB provides an integration with Milvus, enabling users to connect and manage vector data using SQL-like queries. This integration brings federated AI query capabilities across structured and unstructured data with Milvus as the vector database backend. ([Read more](/details/mindsdb-milvus-integration.md)) `Milvus` `Integration` `Ai` `Sql`
- Firecrawl - Web data API that scrapes, crawls, and extracts structured LLM-ready data from any website. Covers 96% of the web including JavaScript-heavy pages with sub-1-second response times. ([Read more](/details/firecrawl.md)) `Web Scraping` `Data Extraction` `Llm Ready`
-
Data Processing
- NVIDIA cuDF - Open-source Python GPU DataFrame library that accelerates popular data engines like Apache Spark, pandas, and Polars on NVIDIA AI infrastructure. Built on Apache Arrow, it utilizes GPU parallelism and memory bandwidth to accelerate data processing and analytics workflows, serving as the data-processing foundation for the Sirius GPU-accelerated database project. ([Read more](/details/nvidia-cudf.md)) `GPU-accelerated` `Dataframe` `Apache Arrow`
- PageIndex - Open-source tool by VectifyAI for pagewise document indexing that converts PDF pages into image representations for downstream multimodal embedding and retrieval. Designed to support late-interaction-based retrieval approaches like ColPali by preserving original document layout and visual structure. ([Read more](/details/pageindex.md)) `Open Source` `Multimodal` `Document Parsing`
- SmallPond - A distributed data processing framework for vector data operations, providing lightweight parallel processing capabilities for embedding pipelines and data preparation workflows. ([Read more](/details/smallpond.md)) `Distributed` `Data Processing` `Embedding Pipeline` `Parallel` `Workflows`
- ruvector-scipix - Rust OCR engine for scientific documents, extracting text and mathematical equations to LaTeX, MathML, or plain text. Supports batch processing, content detection for equations/tables/diagrams, confidence scoring, and PDF support. Includes TypeScript client (@ruvector/scipix) and CLI (scipix-cli). ([Read more](/details/ruvector-scipix.md)) `ocr` `Rust` `scientific` `Open Source`
-
Developer Tools & Benchmarks
- BenchmarkQED - BenchmarkQED standardizes QPS/latency/accuracy evaluations for RAG pipelines including vector DB retrieval on diverse datasets. Features comparable methodologies for fair benchmarking of full RAG stacks. Essential for selecting production vector DBs in RAG; emphasizes retrieval fairness unlike ANN-Benchmarks indexing focus or VectorDBBench system-level throughput tests. ([Read more](/details/benchmarkqed.md)) `Benchmarking` `Performance Evaluation` `Rag Benchmark`
-
Edge Database
- pgEdge - Distributed extension of PostgreSQL designed for multi-region and edge deployments, enabling data to be processed closer to users while maintaining consistency across geographically distributed nodes. ([Read more](/details/pgedge.md)) `Distributed` `Postgresql Extension` `Multi Region`
-
Embedded & Edge Vector Databases
- Sonic - Fast in-memory backend with HNSW vector support for low-latency on-device hybrid full-text + vector search, ideal for edge performance-critical apps. Sub-ms ingest/retrieval; lightweight alternative to cloud Qdrant. ([Read more](/details/sonic.md)) `In Memory Fast` `Lightweight Search` `Open Source` `Edge AI`
- ruvector-core - Rust core for high-performance on-device HNSW vector search with SIMD and compression, achieving low-latency multi-threaded queries for edge AI RAG. Up to 3,597 QPS; optimized for real-time vs cloud alternatives. ([Read more](/details/ruvector-core.md)) `Open Source` `Rust` `Hnsw` `Simd` `Rust Lang` `Performance Critical` `Wasm Support` `Edge AI`
- rvf-launch - QEMU microVM launcher for low-latency RVF cognitive containers in RuVector stack, enabling secure on-device vector processing for edge AI environments. ([Read more](/details/rvf-launch.md)) `Rust` `Microvm` `Qemu` `Virtualization` `Open Source` `Edge AI`
-
Embedded Vector Databases
- embedded-vector-db - Lightweight npm package providing an embedded vector database for Node.js applications. Offers vector similarity search with HNSW, BM25 full-text search, hybrid search using weighted fusion or Reciprocal Rank Fusion (RRF), multi-namespace support, CRUD operations, metadata filtering, concurrency safety, and persistent storage to disk. Designed for RAG pipelines and semantic search use cases. ([Read more](/details/embedded-vector-db.md)) `Open Source` `Embedded` `Lightweight` `No Server` `Hybrid Search` `Nodejs` `Bm25`
- rvLite - Standalone 2MB edge vector database for IoT, mobile, and embedded applications, providing full vector search capabilities without server dependency. ([Read more](/details/rvlite.md)) `Edge` `Embedded` `Standalone` `Lightweight` `No Server`
- tinyvector - A tiny embedding database in pure Rust, implemented as a lightweight Axum server for fast vector search on small to medium datasets. It stores all indexes in memory, enabling vertical scaling to over 100 million vectors with comparable speed and slightly better accuracy than advanced vector databases. Open-source under the MIT license, ideal for simple setups like document chat or website search. ([Read more](/details/tinyvector-m1guelpf.md)) `Open Source` `Rust` `Lightweight` `Embedded` `No Server` `In Memory`
- Victor - Web-optimized vector database written in Rust. It offers Rust and JavaScript APIs with efficient vector storage formats that consume significantly less space than JSON, and supports PCA compression for low-storage scenarios. Designed for native filesystem, in-memory, and web environments via WebAssembly. ([Read more](/details/victor.md)) `Open Source` `Rust` `Embedded` `Lightweight` `No Server` `Wasm`
- VortexDB - Vector database built from scratch in Rust for efficient similarity search in AI applications. Supports pluggable indexers including Flat, KD-Tree, and HNSW, with distance metrics like Euclidean, Manhattan, Hamming, and Cosine. Features HTTP/gRPC servers, TUI client, and RocksDB storage backend. ([Read more](/details/vortexdb.md)) `Open Source` `Rust` `Embedded` `Lightweight` `No Server` `Hnsw`
-
Evaluation & Observability
- Prime Radiant - Coherence Gate engine using sheaf Laplacian for mathematical consistency checks in AI responses. Implements compute ladder routing (Reflex to Human), LLM hallucination blocking, GPU/SIMD acceleration, and cryptographic audit trails. ([Read more](/details/prime-radiant.md)) `Coherence` `hallucination-detection` `graph-neural-networks` `Simd`
-
Experimental & Learning Vector DBs
- vectordb-from-scratch - vectordb-from-scratch is a Rust-based learning project implementing a vector database from basics, focusing on HNSW indexing internals and database fundamentals. Demonstrates core concepts like vector storage, ANN search, and persistence. Educational for understanding VDB architecture; not production-ready, contrasts full DBs like Qdrant. Use cases: tutorials, prototyping indexes. ([Read more](/details/vectordb-from-scratch.md)) `Open Source` `Rust` `Hnsw` `rust-learning` `hnsw-from-scratch` `educational`
-
GPU-Accelerated Vector DBs
- GPU-Accelerated Vector Indexing - Open-source project demonstrating GPU-accelerated approximate nearest neighbor search using Inverted File (IVF) indexing on embeddings from a large Wikipedia dataset. It employs K-means clustering into 128 clusters and supports configurable CUDA kernels for coarse and fine search stages. Applicable for efficient vector querying in AI applications. ([Read more](/details/gpu-accelerated-vector-indexing.md)) `Open Source` `GPU Accelerated` `GPU Support`
-
Graph Database
- HugeGraph - Apache's distributed graph database from Baidu with vector search capabilities via HNSW and DiskANN indexes, supporting billion-scale graph + vector workloads for fraud detection and knowledge graphs. ([Read more](/details/hugegraph.md)) `Apache` `Baidu` `Distributed` `Diskann` `Billion Scale`
- Kuzu - Embedded property graph database with native vector indexing support, combining graph query capabilities with HNSW-based vector search for graph-native AI applications and GraphRAG patterns. ([Read more](/details/kuzu.md)) `Embedded` `Graph Database` `Property Graph` `Graphrag` `Vector Index`
- Neo4j - The leading native graph database platform with vector search support via vector indexes, enabling hybrid graph+vector queries (GraphRAG) for knowledge-intensive AI applications with Cypher query language. ([Read more](/details/neo4j.md)) `Graph Database` `Cypher` `Graphrag` `Vector Index` `Knowledge Graph`
-
Graph-Enhanced Vector DBs
- HugeGraph - Graph-enhanced vector database enabling hybrid graph+vector search for KG RAG applications. Supports graph queries with HNSW/DiskANN vector indexes for efficient multi-hop retrieval over knowledge graphs. Unlike pure vector databases like Pinecone, it natively models relationships for superior connected data reasoning and traversal. ([Read more](/details/hugegraph.md)) `Graph Database` `Kg Rag`
- ruvector-graph - Graph-enhanced vector database enabling hybrid graph+vector search for KG RAG applications. Supports Cypher queries with HNSW vector indexes for efficient multi-hop retrieval over knowledge graphs. Unlike pure vector databases like Pinecone, it natively models relationships for superior connected data reasoning and traversal. ([Read more](/details/ruvector-graph.md)) `Graph Database` `Kg Rag` `Cypher`
-
Hybrid Vector Stores
- Redis Vector Search - Redis Vector Search (part of Redis Stack) enables vector similarity search on Redis with HNSW indexing, hybrid BM25+vector, and metadata filtering. It leverages Redis caching for low-latency real-time apps like semantic search. Vs dedicated DBs like Pinecone, Redis offers multi-model (JSON/KV + vectors) but requires more config for scale. ([Read more](/details/redis-vector-search.md)) `In-Memory Vector Search` `Hybrid BM25` `High Throughput` `Redis Stack` `hybrid bm25` `real time cache` `Redisearch`
-
In-Memory Hybrid Vector Stores
- RedisVL - RedisVL extends Redis with vector search via RediSearch module, HNSW indexes, hybrid BM25+vector. Great for caching/real-time RAG; vs dedicated VDBs leverages Redis speed/multi-model. Features: JSON payloads, streaming. ([Read more](/details/redisvl.md)) `Hybrid Bm25 Vector` `Real Time Cache`
-
Integrations & Extensions
- Neo4j Vector Search - Vector similarity search in Neo4j enabling GraphRAG by combining knowledge graphs with vector embeddings. ([Read more](/details/neo4j-graph-vector-search.md)) `Graph Database` `Knowledge Graph` `Rag`
- SQL Server Vector Search - Native vector search capabilities in SQL Server 2022 and Azure SQL, enabling vector similarity search alongside traditional relational data. Supports storing vectors as varbinary and performing approximate nearest neighbor queries. ([Read more](/details/sql-server-vector-search.md)) `Sql` `Microsoft` `database` `Hybrid`
-
Libraries
- LangChain4j - LangChain4j is a Java library providing vector search and embedding capabilities for LLM applications via integrations with ANN indexes like HNSW and FAISS, without needing full vector databases. Features include support for quantization, tool calling, and seamless embedding in JVM environments like Spring Boot and Quarkus. Suited for prototyping RAG agents and embedded apps; lighter and more JVM-native than Milvus, easier integration vs hnswlib. ([Read more](/details/langchain4j.md)) `Open Source` `Java` `Rag` `ANN Library` `Embeddable`
-
llm-frameworks
- ruvector-sona - Rust crate for Self-Optimizing Neural Architecture (SONA) with LoRA adaptation, EWC++ plasticity, and ReasoningBank learning. Enables continuous improvement in LLM routers and agents without forgetting. ([Read more](/details/ruvector-sona.md)) `Open Source` `Rust` `sona` `lora`
-
Llm Frameworks
- Haystack - An open-source NLP framework for building end-to-end search systems, which can leverage vector search capabilities. ([Read more](/details/haystack.md)) `NLP` `framework` `open-source`
- LlamaIndex - LlamaIndex is a data framework for large language model (LLM) applications, providing tools to ingest, structure, and access private or domain-specific data, often integrating with vector databases for retrieval augmented generation (RAG). ([Read more](/details/llamaindex.md)) `LLM` `RAG` `framework`
- DSPy - Programming framework for RAG and AI applications with cutting-edge optimization capabilities, featuring the lowest framework overhead and automatic improvement based on example data. ([Read more](/details/dspy.md)) `Rag` `Python` `Optimization`
-
LLM Frameworks
- CrewAI - Leading open-source framework for orchestrating autonomous AI agent crews with production-ready workflows combining collaborative intelligence through Crews and precise control through Flows. ([Read more](/details/crewai.md)) `multi-agent` `Python` `autonomous`
- AutoGen - Open-source framework from Microsoft Research for building AI agents and facilitating multi-agent cooperation with support for two-agent chat, sequential chat, and group chat patterns. ([Read more](/details/autogen.md)) `Microsoft` `multi-agent` `conversational-ai`
- AutoRAG - Automated framework for optimizing Retrieval Augmented Generation pipelines using AutoML-style techniques to find the best RAG module combinations and parameters for specific datasets. ([Read more](/details/autorag-optimization-framework.md)) `Rag` `Optimization` `Automl`
- Canopy - Open-source Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone, providing automatic chunking, embedding, chat history management, and query optimization. ([Read more](/details/canopy-rag-framework.md)) `Rag` `Open Source` `Context Engine`
- Embedchain - Open Source RAG Framework designed to be 'Conventional but Configurable', streamlining the creation of RAG applications with efficient data management, embeddings generation, and vector storage. ([Read more](/details/embedchain.md)) `Rag` `Open Source` `Python`
- FlashRAG - Python toolkit for efficient RAG research providing 36 pre-processed benchmark datasets and 23 state-of-the-art RAG algorithms in a unified, modular framework for reproduction and development. ([Read more](/details/flashrag.md)) `Rag` `Open Source` `Python`
- h2oGPT - Apache 2.0 open-source project for querying and summarizing documents or chatting with local private GPT LLMs. Supports Ollama, Mixtral, llama.cpp with persistent databases (Chroma, Weaviate, FAISS) and accurate embeddings. ([Read more](/details/h2ogpt.md)) `Open Source` `Privacy` `Local Llm`
- LightRAG - Simple and efficient retrieval-augmented generation framework that combines document retrieval with generation, focusing on speed and ease of use. Designed to run on standard CPUs and laptops with minimal resource requirements. ([Read more](/details/lightrag.md)) `Rag` `Lightweight` `Open Source`
- LLMWare - Retrieval-augmented generation framework that utilizes small, specialized models instead of large language models, significantly reducing computational and financial costs while offering cost-effective RAG solutions that can run on standard hardware. ([Read more](/details/llmware.md)) `Rag` `Cost Effective` `Open Source`
- Mirascope - Lightweight Python toolkit for LLM application development that provides modular building blocks with a unified interface across providers, emphasizing Python-first design without unnecessary abstractions. ([Read more](/details/mirascope.md)) `Python` `modular` `Multi Provider`
- Neo4j GraphRAG Python - Official Neo4j package for building graph retrieval augmented generation (GraphRAG) applications in Python. Enables developers to create knowledge graphs and implement advanced retrieval methods including graph traversals, text-to-Cypher, and vector searches. ([Read more](/details/neo4j-graphrag-python.md)) `Graphrag` `Knowledge Graph` `Rag`
- NVIDIA NeMo Retriever - Collection of industry-leading Nemotron RAG models delivering 50% better accuracy, 15x faster multimodal PDF extraction, and 35x better storage efficiency for building enterprise-grade retrieval-augmented generation pipelines. ([Read more](/details/nvidia-nemo-retriever.md)) `Rag` `Multimodal` `Microservices`
- RAGatouille - Python library designed to simplify the integration and training of state-of-the-art late-interaction retrieval methods, particularly ColBERT, within RAG pipelines with a modular and user-friendly interface. ([Read more](/details/ragatouille.md)) `Rag` `Colbert` `Retrieval`
- RAGFlow - Open-source RAG engine based on deep document understanding with citation-backed responses. RAGFlow extracts tables, images, and structured data from complex documents, providing truthful question-answering with well-founded citations. ([Read more](/details/ragflow.md)) `Open Source` `Rag` `Document Understanding`
- Semantic Kernel - Open-source SDK from Microsoft that enables developers to build AI agents and integrate LLMs into applications with support for multi-agent orchestration, function calling, and memory management across C#, Python, and Java. ([Read more](/details/semantic-kernel.md)) `Microsoft` `multi-agent` `orchestration`
- Mastra - AI agent framework featuring Observational Memory that achieves 95% on LongMemEval with 5-40x compression and stable, reproducible context windows. ([Read more](/details/mastra.md)) `Agent Framework` `Observational Memory` `Compression`
- Emergence AI - Enterprise agentic platform for automating workflows with self-improving agents using plan-execute-verify framework. Achieved 86% accuracy on LongMemEval benchmark. ([Read more](/details/emergence-ai.md)) `Enterprise` `Workflow Automation` `Multi Agent`
- Letta - Platform for building stateful AI agents with advanced memory that can learn and self-improve over time. Uses OS-inspired approach with main context as RAM and external storage as disk. ([Read more](/details/letta.md)) `Ai Agents` `Memory` `Stateful`
- OpenJarvis - Local-first framework for building on-device personal AI agents with tools, memory, and learning capabilities. Runs entirely on-device with five composable primitives: intelligence, engine, agents, tools & memory, and learning. ([Read more](/details/openjarvis.md)) `On Device` `Local First` `Ai Agents`
- Prem AI - Swiss-based sovereign AI platform for enterprises needing full data control. Features cryptographic verification, zero-data-retention architecture, and complete model lifecycle management. ([Read more](/details/prem-ai.md)) `Sovereign Ai` `Privacy` `Enterprise`
- smolagents - Minimalist AI agent framework from Hugging Face that enables powerful agents in just a few lines of code with a code-first approach and support for any LLM. ([Read more](/details/smolagents.md)) `Ai Agents` `Minimalist` `Code First`
- ACE Framework - Agentic Context Engineering (ACE) framework for generating, managing, and leveraging instructive contexts in a structured way for self-improving language models. Studies how to systematically create and utilize context that guides Agent behavior, including tool usage guides and best practices. ([Read more](/details/ace-framework.md)) `Context Engineering` `Agents` `Self Improvement`
- MemVerse - Multimodal memory system for lifelong learning agents capable of simultaneously understanding and remembering text, images, and video. Represents a step beyond traditional text-only memory systems toward multimodal context management for AI agents operating in diverse data environments. ([Read more](/details/memverse.md)) `Multimodal Memory` `Lifelong Learning` `Agents`
- PrivateGPT - Production-ready AI project for private, local document Q&A using RAG. 100% private with no data leaving your environment, supporting offline operation with local LLMs and vector databases. ([Read more](/details/privategpt.md)) `privacy` `Local` `Rag`
-
Llm Tools
- HuggingFace Text Embedding Server - A server that provides text embeddings, serving as a backend for embedding functions used with vector databases. ([Read more](/details/huggingface-text-embedding-server.md)) `embeddings` `Hugging Face` `API`
- Ollama - A tool that allows users to run large language models locally, providing an easy way to set up and interact with various models, including integrations for generating and managing embeddings with vector databases. ([Read more](/details/ollama.md)) `LLM` `local` `tool`
- RAGAS - A framework for performing Retrieval-Augmented Generation (RAG) evaluation, supporting multiple ways of validating results. ([Read more](/details/ragas.md)) `RAG` `evaluation` `LLM`
- VectorAdmin - VectorAdmin is a universal vector database management system that serves as a frontend for vector databases, helping users manage, inspect, and work with vector data across different backends while reducing time spent wrangling vectors and associated embedding costs. ([Read more](/details/vectoradmin.md)) `management` `GUI` `vector stores`
- Elysia - Elysia is an open-source, decision-tree-based agentic system built on top of Weaviate that orchestrates tools and vector-search workflows, demonstrating how to build complex AI agents that leverage a vector database as a core component. ([Read more](/details/elysia.md)) `RAG` `tools` `vector search`
- Verba - Verba is a community-driven, open-source Retrieval-Augmented Generation (RAG) application that provides an end-to-end, user-friendly interface for building RAG workflows on top of a vector database, showcasing practical semantic search and retrieval patterns with Weaviate. ([Read more](/details/verba.md)) `RAG` `semantic search` `open-source`
- Langfuse - Open-source LLM engineering platform providing observability, metrics, evaluations, and prompt management. Integrates with OpenTelemetry, LangChain, OpenAI SDK, and vector databases for RAG pipeline monitoring. ([Read more](/details/langfuse.md)) `Observability` `Open Source` `Prompt Management`
- Langtrace - Open-source LLM observability tool built on OpenTelemetry standards. Automatically captures traces from LLM APIs, vector databases, and frameworks with support for over 30 popular providers. ([Read more](/details/langtrace.md)) `Observability` `Open Source` `Opentelemetry`
- Monte Carlo Vector Database Observability - Data observability platform specifically supporting vector databases including Pinecone, providing comprehensive monitoring across the five pillars of data observability. ([Read more](/details/monte-carlo-vector-database-observability.md)) `Observability` `Data Quality` `Monitoring`
-
LLM Tools
- Mem0 - A universal, self-improving AI memory layer for LLM applications that delivers 26% higher accuracy, 91% lower latency, and 90% token savings through intelligent memory extraction and retrieval, backed by $24M in funding and selected as AWS's exclusive memory provider. ([Read more](/details/mem0.md)) `Agent Memory` `Llm Optimization` `Managed Service`
- ARES - RAG evaluation framework that trains lightweight judges for retrieval and generation scoring, refining evaluation by training specialized LLM judges on synthetic datasets to provide more reliable, confidence-aware judgments. ([Read more](/details/ares.md)) `Evaluation` `Rag` `Open Source`
- AWQ - Activation-aware Weight Quantization method that preserves model accuracy at 4-bit quantization by identifying and skipping important weights. Maintains 99%+ of original performance with moderate inference speed improvements. ([Read more](/details/awq.md)) `Quantization` `Optimization` `Compression`
- Cohere Rerank - Proprietary neural network reranker accessed via API that processes query and document together as a cross-encoder to precisely judge relevance. Supports over 100 languages with Rerank 3 Nimble variant for faster production performance. ([Read more](/details/cohere-rerank.md)) `Reranking` `Api` `Multilingual`
- DeepEval - Comprehensive LLM evaluation framework offering 50+ ready-to-use metrics for RAG, agents, and chatbots, featuring G-Eval for custom criteria and multi-turn conversation evaluation with human-like accuracy. ([Read more](/details/deepeval.md)) `Evaluation` `Testing` `Metrics`
- Docling - Open-source document parsing framework from IBM with 97.9% accuracy in complex table extraction and excellent text fidelity. Self-hostable solution for converting PDFs, spreadsheets, and scanned images into structured data for RAG pipelines. ([Read more](/details/docling.md)) `Document Parsing` `Open Source` `Rag`
- Document Loaders - Components in LLM frameworks that fetch and parse data from various sources (PDFs, websites, databases) into a standardized format for processing. Essential first step in RAG pipelines for converting raw data into processable documents. ([Read more](/details/document-loaders.md)) `Document Processing` `Loaders` `Rag`
- Feder - Visualization tool for ANNS (Approximate Nearest Neighbor Search) algorithms enabling users to observe index structures, parameter configurations, and the complete vector similarity search process. ([Read more](/details/feder.md)) `Visualization` `Ann` `Hnsw`
- FiftyOne - Computer vision interface for vector search with native integrations for Qdrant, Pinecone, LanceDB, and Milvus. Enables natural language search, configurable vector database backends, and visualization of search matches across billions of images. ([Read more](/details/fiftyone.md)) `Computer Vision` `Visualization` `Vector Search`
- Flowise - Open-source no-code platform built on LangChain for visually building AI workflows, agents, and chatbots using drag-and-drop components with ready-to-use templates and seamless cloud deployment. ([Read more](/details/flowise.md)) `No Code` `Langchain` `visual-builder`
- GGUF - GPT-Generated Unified Format for storing quantized model weights, designed for CPU inference and consumer hardware. Enables running LLMs on laptops and edge devices with flexible layer offloading to GPU. ([Read more](/details/gguf.md)) `Quantization` `Cpu` `Format`
- GPTQ - Post-training quantization method for 4-bit weight compression that focuses on GPU inference performance. First quantization method to compress LLMs to 4-bit range while maintaining accuracy, minimizing mean squared error to weights. ([Read more](/details/gptq.md)) `Quantization` `Compression` `Optimization`
- Guardrails AI - Python framework for building reliable AI applications through input/output validation, with a hub of pre-built validators for detecting risks like PII, profanity, and logical fallacies in LLM outputs. ([Read more](/details/guardrails-ai.md)) `validation` `safety` `quality-control`
- KRAGEN - Knowledge Retrieval Augmented Generation ENgine that combines knowledge graphs with RAG using graph-of-thoughts prompting to solve complex biomedical problems with transparent, evidence-based reasoning. ([Read more](/details/kragen.md)) `Knowledge Graph` `biomedical` `graph-of-thoughts`
- llamafile - Single-file executable that bundles LLM weights and llama.cpp runtime. Distribute and run LLMs locally with no installation, including embedding generation via built-in server. ([Read more](/details/llamafile.md)) `Local Llm` `Single File` `Embeddings`
- LlamaParse - High-performance document parsing service by LlamaIndex that consistently processes documents in about 6 seconds regardless of size. Returns rich Markdown and optional HTML tables with wide format support through hosted API. ([Read more](/details/llamaparse.md)) `Document Parsing` `Api` `Rag`
- Milvus WebUI - Built-in GUI introduced in Milvus v2.5 for system observability, offering real-time monitoring of system health, collection management, and query optimization from a unified dashboard. ([Read more](/details/milvus-webui.md)) `Visualization` `Monitoring` `Milvus`
- Nomic Atlas - AI-ready data visualization platform for massive datasets of embeddings. Atlas enables interactive exploration of millions of vectors in your web browser, with automatic dimensionality reduction and semantic clustering. ([Read more](/details/nomic-atlas.md)) `Visualization` `Embeddings` `Analytics`
- Opik - An open-source LLM observability and evaluation platform that provides comprehensive tracking, monitoring, and evaluation capabilities for large language model applications. Designed for production AI systems with focus on debugging and performance optimization. ([Read more](/details/opik.md)) `Observability` `Monitoring` `Llm`
- Promptfoo - Open-source CLI and library for evaluating and red-teaming LLM applications with automated testing, security vulnerability scanning, and CI/CD integration. Recently acquired by OpenAI but remains open-source. ([Read more](/details/promptfoo.md)) `Testing` `red-teaming` `Evaluation`
- Ragas - RAG Assessment framework for Python providing reference-free evaluation of RAG pipelines using LLM-as-a-judge, measuring context relevancy, context recall, faithfulness, and answer relevancy with automatic test data generation. ([Read more](/details/ragas.md)) `Evaluation` `Rag` `Testing`
- Rivet - Open-source visual AI programming environment from Ironclad for building complex AI agents and prompt chains using node-based drag-and-drop interface with real-time debugging capabilities. ([Read more](/details/rivet.md)) `visual-programming` `No Code` `agents`
- TruLens - Open-source evaluation and tracing library for AI agents and RAG systems, combining OpenTelemetry-based tracing with trustworthy evaluations including ground truth metrics and LLM-as-a-Judge feedback for production monitoring. ([Read more](/details/trulens.md)) `Observability` `Evaluation` `Tracing`
- VectorDBZ - Enterprise-grade desktop application for managing and analyzing vector databases with interactive visualizations, supporting Qdrant, Weaviate, Milvus, ChromaDB, Pinecone, pgvector, and Elasticsearch. ([Read more](/details/vectordbz.md)) `Visualization` `Management` `Gui`
- Zep - Context engineering and agent memory platform for AI agents with sub-200ms latency. Zep uses a temporal knowledge graph architecture to deliver relationship-aware context from chat history, business data, documents, and app events. ([Read more](/details/zep.md)) `Ai Agents` `Knowledge Graph` `Memory`
- Cursor - AI-powered code editor and IDE built on VSCode with Composer 1.5 for multi-file editing, Background Agents for autonomous coding, and support for frontier models from OpenAI, Anthropic, Gemini, and xAI. ([Read more](/details/cursor.md)) `Ide` `Code Editor` `Ai Coding`
- Model Context Protocol - Open standard from Anthropic for connecting AI systems to external data sources and tools. Donated to the Linux Foundation's Agentic AI Foundation in December 2025. ([Read more](/details/model-context-protocol.md)) `Protocol` `Integration` `Open Standard`
- Supermemory - State-of-the-art AI agent memory system using ASMR technique that achieved ~99% accuracy on LongMemEval benchmark with multi-agent orchestrated pipeline. ([Read more](/details/supermemory.md)) `Agent Memory` `Asmr` `Benchmark`
- Agent Client Protocol - Protocol that enables AI coding assistants like Cursor to integrate with JetBrains IDEs, allowing developers to use frontier models across different development environments. ([Read more](/details/agent-client-protocol.md)) `Protocol` `Integration` `Ide`
- Blaxel - Perpetual sandbox platform for AI agents that achieves sub-25ms resume times from standby mode with infinite state persistence and zero compute charges during idle periods. ([Read more](/details/blaxel.md)) `Sandbox` `Perpetual` `Microvm`
- COPRO - A DSPy optimizer that generates and refines new instructions for each step in language model pipelines, optimizing them with coordinate ascent. Automates the prompt engineering process by systematically improving instruction quality through iterative refinement. ([Read more](/details/copro.md)) `Optimization` `Prompt Engineering` `automated`
- E2B - Open-source cloud infrastructure providing secure sandboxes for AI agents to run code in isolated environments. Sandboxes start in 80ms and support Python, JavaScript, Ruby, and C++ on Linux. ([Read more](/details/e2b.md)) `Sandbox` `Security` `Infrastructure`
- Firecracker microVM - Open-source virtualization technology from AWS that powers secure sandboxes for AI agents with hardware-level isolation. Used by E2B and other sandbox platforms. ([Read more](/details/firecracker-microvm.md)) `Virtualization` `Security` `Microvm`
- Modal - Serverless compute platform for AI with custom Rust-based infrastructure that spins up GPU-enabled containers in one second, supporting Python workloads with per-second billing. ([Read more](/details/modal.md)) `Serverless` `Gpu` `Infrastructure`
- NVIDIA NIM - Accelerated inference microservices that allow organizations to run AI models on NVIDIA GPUs anywhere with optimized inference engines, industry-standard APIs, and runtime dependencies in enterprise-grade containers. ([Read more](/details/nvidia-nim.md)) `Inference` `Microservices` `Gpu`
- OpenLLMetry - Open-source observability for GenAI and LLM applications based on OpenTelemetry, providing AI-aware instrumentation for vector databases, LLM frameworks, and model providers. ([Read more](/details/openllmetry.md)) `Observability` `Monitoring` `Tracing`
- USD Code NIM - NVIDIA NIM microservice that answers OpenUSD questions and automatically generates OpenUSD-Python code from text prompts for 3D workflow automation. ([Read more](/details/usd-code-nim.md)) `3d` `Code Generation` `Nvidia`
- Vanna AI - RAG-powered text-to-SQL framework that enables natural language querying of SQL databases using vector search for retrieval of relevant schema, documentation, and example queries. ([Read more](/details/vanna-ai.md)) `Text To Sql` `Rag` `Llm`
- W&B Weave - LLM observability platform from Weights & Biases that automatically tracks all LLM calls, evaluations, and experiments with support for prompt engineering and vector store integration. ([Read more](/details/wandb-weave.md)) `Observability` `Experiment Tracking` `Prompt Engineering`
- Wren AI - Open-source GenBI platform that queries databases in natural language, generates SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered business intelligence using RAG architecture. ([Read more](/details/wren-ai.md)) `Text To Sql` `Business Intelligence` `Rag`
- Xinference - Open-source platform for serving LLMs, embedding models, and multimodal models with OpenAI-compatible APIs, distributed deployment, and automatic batching for scalable AI model inference. ([Read more](/details/xinference.md)) `Model Serving` `Embeddings` `Inference`
- Amazon Bedrock Knowledge Bases - A fully managed service within Amazon Bedrock that automates the retrieval-augmented generation (RAG) workflow by ingesting unstructured and structured data, converting it into embeddings, and storing them in supported vector databases. It enables grounding generative AI responses with enterprise data without manual orchestration. ([Read more](/details/amazon-bedrock-knowledge-bases.md)) `Managed Service` `Rag` `Aws`
- Augment Code - AI-powered code search and coding assistant tool that uses fine-tuned specialized embedding models for code semantics rather than relying on simple string matching (Grep). Provides context for coding assistants through semantically similar code snippets beyond exact string matching. ([Read more](/details/augment-code.md)) `Code Search` `Embedding Models` `Developer Tools`
- GEPA - Genetic algorithm-based prompt optimizer within the DSPy framework. Uses evolutionary strategies to iteratively improve prompt text, including prompts containing tool usage logic. Part of DSPy's suite of optimization methods for automatically enhancing language model program performance. ([Read more](/details/gepa.md)) `Prompt Optimization` `Genetic Algorithms` `Dspy`
- Inference - A powerful RAG application platform delivering OpenAI-compatible serverless inference APIs for top open-source LLM models. Offers specialized batch processing for large-scale async AI workloads and document extraction capabilities designed for RAG applications, balancing cost-efficiency with high performance. ([Read more](/details/inference.md)) `Serverless` `Rag` `Inference Api`
- Helicone - Open-source observability layer designed to help developers monitor and understand how their applications interact with large language models. Acts as a lightweight proxy between applications and LLM providers. ([Read more](/details/helicone.md)) `observability` `Monitoring` `Open Source`
- LiteLLM - Open-source proxy and SDK that provides a single unified API to call and manage hundreds of different LLM providers and models with OpenAI-compatible endpoints. Simplifies multi-provider LLM integration. ([Read more](/details/litellm.md)) `Open Source` `api` `Llm`
- Recursive Character Text Splitter - Document chunking strategy that splits text at hierarchical boundaries like paragraphs, sentences, or headings. Industry-standard approach recommended as starting point with 400-512 tokens and 10-20% overlap for optimal RAG performance. ([Read more](/details/recursive-character-text-splitter.md)) `chunking` `text-processing` `Rag`
- Semantic Chunker - Document chunking strategy that dynamically chooses split points between sentences based on embedding similarity rather than fixed sizes. Maintains semantic coherence by grouping related content together for improved RAG retrieval. ([Read more](/details/semantic-chunker.md)) `chunking` `Semantic Search` `Embeddings`
- Xinference - Open-source platform for serving LLMs, embedding models, and multimodal models with OpenAI-compatible APIs, distributed deployment, and automatic batching for scalable AI model inference. ([Read more](/details/xinference.md)) `model-serving` `Embeddings` `inference`
-
Machine Learning Models
- all-MiniLM-L6-v2 - A compact and efficient pre-trained sentence embedding model, widely used for generating vector representations of text. It's a popular choice for applications requiring fast and accurate semantic search, often integrated with vector databases. ([Read more](/details/all-minilm-l6-v2.md)) `embeddings` `NLP` `AI`
Programming Languages
Categories
Research Papers & Surveys
135
Concepts & Definitions
131
Vector Database Engines
91
Sdks & Libraries
77
Machine Learning Models
67
Curated Resource Lists
56
LLM Tools
50
SDKs & Libraries
42
Open Sources
32
Sdks Libraries
28
Benchmarks & Evaluation
26
Vector Database Extensions
26
LLM Frameworks
24
Managed Vector Databases
21
Data Integration & Migration
20
Core Vector Databases
18
Multi Model & Hybrid Databases
15
Cloud Services
13
Llm Tools
9
Security & Governance
9
Commerce
8
vector-database-engines
5
Relational Databases
5
Embedded Vector Databases
5
Vector Database
4
Data Processing
4
ANN Indexing Libraries
3
Embedded & Edge Vector Databases
3
Cloud & Managed
3
Vector DB Research & Surveys
3
Graph Database
3
Llm Frameworks
3
Integrations & Extensions
2
⭐ Star History
2
curated-resource-lists
2
Open Source Vector Databases
2
Quantum-Safe Vector DBs
2
🔥 Acknowledgements
2
Rust-based Vector Databases
2
Graph-Enhanced Vector DBs
2
Multi-Model & Hybrid Databases
2
Multimodal Vector DBs
1
Rust-Based Vector DBs
1
Search & Retrieval
1
Edge Database
1
AI Agent Optimized VDBs
1
Evaluation & Observability
1
Tools
1
serverless-managed-vector-dbs
1
Multimodal Vector Databases
1
Cloud-managed Vector Databases
1
llm-frameworks
1
Vector Indexing Libraries
1
research-papers-surveys
1
GPU-Accelerated Vector DBs
1
Scalable Distributed Vector DBs
1
In-Memory Hybrid Vector Stores
1
2026 Trends & Startups
1
Managed and Serverless Vector DBs
1
Wasm/Edge Runtime VDBs
1
Managed & Serverless Vector DBs
1
Hybrid Vector Stores
1
Experimental & Learning Vector DBs
1
Developer Tools & Benchmarks
1
Libraries
1
Sub Categories
Keywords
vector-database
33
vector-search
31
llm
29
rag
22
ai
20
similarity-search
17
nearest-neighbor-search
17
embeddings
16
search-engine
15
approximate-nearest-neighbor-search
15
machine-learning
14
information-retrieval
13
database
13
rust
11
search
11
python
10
retrieval-augmented-generation
10
semantic-search
9
openai
9
vector
8
vector-search-engine
8
chatgpt
8
milvus
7
gpt
7
llms
6
hnsw
6
faiss
6
benchmark
6
qdrant
5
knn-search
5
sql
5
artificial-intelligence
5
clustering
5
nearest-neighbors
5
hybrid-search
5
full-text-search
5
embedding
5
image-search
5
retrieval
4
webassembly
4
postgresql
4
vectors
4
gpt-4
4
large-language-models
4
pinecone
4
ann
4
ml
4
agents
3
elasticsearch
3
genai
3