An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with vector-search

A curated list of projects in awesome lists tagged with vector-search .

https://github.com/onyx-dot-app/onyx

Open Source AI Platform - AI Chat with advanced features that works with every LLM

ai ai-chat chatgpt chatui enterprise-search gen-ai information-retrieval llm llm-ui nextjs python rag self-hosted vector-search

Last synced: 02 Jun 2026

https://github.com/qdrant/qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

ai-search ai-search-engine embeddings-similarity hnsw image-search knn-algorithm machine-learning mlops nearest-neighbor-search neural-network neural-search recommender-system search search-engine search-engines similarity-search vector-database vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚑ πŸ” ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

algolia datastore elasticsearch enterprise-search faceting full-text-search fuzzy-search geosearch in-memory instantsearch merchandising pinecone search search-engine semantic-search similarity-search site-search synonyms typo-tolerance vector-search

Last synced: 13 May 2025

https://github.com/weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate

Last synced: 02 Jun 2026

https://github.com/tencent/weknora

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

agent agentic ai chatbot chatbots embeddings evaluation generative-ai golang knowledge-base llm multi-tenant multimodel ollama openai question-answering rag reranking semantic-search vector-search

Last synced: 15 Apr 2026

https://github.com/StarTrail-org/LEANN

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 02 Jun 2026

https://github.com/yichuan-w/leann

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 08 Mar 2026

https://github.com/oramasearch/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/databendlabs/databend

Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. β€” rebuilt from scratch. Unified architecture on your S3.

ai bigdata cloud-native database elasticsearch geospatial lakehouse olap rust serverless snowflake sql vector-database vector-search

Last synced: 18 May 2026

https://github.com/askorama/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 09 Apr 2025

https://github.com/activeloopai/deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search

Last synced: 11 Feb 2026

https://github.com/srbhr/resume-matcher

Resume Matcher is an open source, free tool to improve your resume. It works by using AI, Reader LLMs, to compare and rank resumes with job descriptions.

applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings

Last synced: 08 May 2025

https://github.com/srbhr/Resume-Matcher

Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.

applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings

Last synced: 26 Mar 2025

https://github.com/microsoft/sptag

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.

approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search

Last synced: 07 May 2025

https://github.com/microsoft/SPTAG

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.

approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search

Last synced: 15 Mar 2025

https://github.com/Tencent/TencentDB-Agent-Memory

TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.

agent ai-agent embedding llm local-first long-term-memory memory openclaw-plugin vector-search

Last synced: 28 May 2026

https://github.com/infiniflow/infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

ai-native approximate-nearest-neighbor-search bm25 cpp20 cpp20-modules embedding full-text-search hnsw hybrid-search information-retrival nearest-neighbor-search rag search-engine tensor-database vector vector-database vector-search vectordatabase

Last synced: 02 Apr 2026

https://github.com/RyanCodrai/turbovec

A vector index built on TurboQuant, written in Rust with Python bindings

ann avx512 embedding embeddings faiss nearest-neighbor neon python quant quantization rag rust simd turboquant vector-search

Last synced: 02 Jun 2026

https://github.com/pashpashpash/vault-ai

OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.

ai artificial-intelligence chatgpt generative go golang knowledge-base long-term-memory machine-learning openai openai-api pdf-support pinecone qdrant-vector-database question-answering react reactjs vector-search

Last synced: 14 May 2025

https://github.com/hegelai/prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

deep-learning developer-tools embeddings large-language-models llms machine-learning prompt-engineering python vector-search

Last synced: 14 May 2025

https://github.com/chonkie-ai/chonkie

πŸ¦› CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

ai chunking etl nlp python rag retrieval semantic-segmentation text-chunking text-processing text-splitting vector-search

Last synced: 14 May 2025

https://github.com/unum-cloud/usearch

Fast Open-Source Search & Clustering engine Γ— for Vectors & πŸ”œ Strings Γ— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ”

approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly

Last synced: 16 Apr 2026

https://github.com/asg017/sqlite-vss

A SQLite extension for efficient vector search, based on Faiss!

faiss sqlite sqlite-extension vector-search

Last synced: 14 May 2025

https://github.com/mintplex-labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 31 Oct 2025

https://github.com/Mintplex-Labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 24 Mar 2025

https://github.com/datastax/jvector

JVector: the most advanced embedded vector search engine

ann java knn machine-learning search-engine similarity-search vector-search

Last synced: 03 Apr 2026

https://github.com/ashvardanian/NumKong

SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types β€” from 6-bit floats to 64-bit complex β€” across x86, Arm, RISC-V, and WASM, with bindings for Python, Rust, C, C++, Swift, JS, and Go πŸ“

arm-neon assembly blas cpp golang information-retrieval javascript matrix-multiplication metrics numpy rust scipy simd swift tensor vector-search

Last synced: 22 Mar 2026

https://github.com/supabase-community/nextjs-openai-doc-search

Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.

ai chatgpt nextjs openai postgres supabase template vector-search

Last synced: 15 May 2025

https://github.com/supabase-community/nextjs-openai-doc-search?og=v2

Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.

ai chatgpt nextjs openai postgres supabase template vector-search

Last synced: 21 Apr 2025

https://github.com/jbellis/jvector

JVector: the most advanced embedded vector search engine

ann java knn machine-learning search-engine similarity-search vector-search

Last synced: 13 Mar 2025

https://github.com/qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

embeddings openai rag retrieval retrieval-augmented-generation vector-search

Last synced: 26 Mar 2025

https://github.com/superlinked/superlinked

Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

data-pipeline deep-learning embeddings etl information-retrieval llm ml mlops natural-language-processing nlp python retrieval retrieval-augmented-generation semantic-search vector-database vector-search vectorization

Last synced: 16 Jan 2026

https://github.com/ashvardanian/simsimd

Up to 200x Faster Dot Products & Similarity Metrics β€” for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 πŸ“

arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 13 May 2025

https://github.com/giancarloerra/socraticode

Enterprise-grade (40m+ LOC) codebase intelligence, zero-setup, private & local Plugin/Skill or MCP: hybrid semantic search, polyglot dependency graphs, symbol-level impact analysis & call-flow, interactive HTML viewer, cross-project & branch-aware search, DB/API/infra knowledge. 61% less tokens, 84% fewer calls, 37x faster. Cloud in private beta.

ai ai-assistant ast claude claude-code code-graph codebase-intelligence context-engine docker embeddings gemini gemini-cli-extension mcp openai qdrant semantic semantic-search vector-database vector-embeddings vector-search

Last synced: 04 May 2026

https://github.com/ashvardanian/SimSIMD

Up to 200x Faster Dot Products & Similarity Metrics β€” for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 πŸ“

arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 23 Mar 2025

https://github.com/unum-cloud/UForm

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and πŸ”œ video, up to 5x faster than OpenAI CLIP and LLaVA πŸ–ΌοΈ & πŸ–‹οΈ

bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search

Last synced: 19 Apr 2026

https://github.com/unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and πŸ”œ video, up to 5x faster than OpenAI CLIP and LLaVA πŸ–ΌοΈ & πŸ–‹οΈ

bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search

Last synced: 14 May 2025

https://github.com/qdrant/qdrant-client

Python client for Qdrant vector search engine

qdrant vector-database vector-search vector-search-engine

Last synced: 08 Jul 2025

https://github.com/myscale/myscaledb

A @ClickHouse fork that supports high-performance vector search and full-text search.

ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb

Last synced: 12 Jan 2026

https://github.com/tantaraio/voy

πŸ•ΈοΈπŸ¦€ A WASM vector similarity search written in Rust

k-d-tree nearest-neighbor-search rust similarity-search vector-search wasm wasm-pack webassembly

Last synced: 01 Apr 2025

https://github.com/myscale/MyScaleDB

A @ClickHouse fork that supports high-performance vector search and full-text search.

ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb

Last synced: 12 Mar 2025

https://github.com/epsilla-cloud/vectordb

Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/

ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search

Last synced: 15 May 2025

https://github.com/rapidsai/raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

anns building-blocks clustering cuda distance gpu information-retrieval linear-algebra llm machine-learning nearest-neighbors neighborhood-methods primitives random-sampling solvers sparse statistics vector-search vector-similarity vector-store

Last synced: 14 May 2025

https://github.com/tensorchord/vectorchord

Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.

artificial-intelligence llmops postgresql vector-database vector-search

Last synced: 21 Jun 2025

https://github.com/azure/azure-search-vector-samples

A repository of code samples for Vector search capabilities in Azure AI Search.

azure azurecognitivesearch embeddings vector vector-search

Last synced: 14 May 2025

https://github.com/arcadedata/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 24 Apr 2026

https://github.com/Azure/azure-search-vector-samples

A repository of code samples for Vector search capabilities in Azure AI Search.

azure azurecognitivesearch embeddings vector vector-search

Last synced: 25 Mar 2025

https://github.com/prithivirajdamodaran/flashrank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 14 May 2025

https://github.com/weaviate/recipes

This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!

function-calling generative-ai llm-frameworks python retrieval-augmented-generation vector-database vector-search

Last synced: 15 May 2025

https://github.com/anush008/fastembed-rs

Rust library for vector embeddings and reranking. Inspired by qdrant/fastembed.

embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search

Last synced: 19 Feb 2026

https://github.com/PrithivirajDamodaran/FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 08 May 2025

https://github.com/unum-cloud/ustore

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang πŸ—„οΈ

acid apache-arrow arrow big-data bigdata database dataloader document-database graph-database iouring json key-value-store knn-search networkx nosql pandas python search spdk vector-search

Last synced: 11 Apr 2025

https://github.com/christopherkarani/Wax

🍯 Memory layer for on-device AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer.

ai-agents cli coreml coreml-framework data-science machine-learning mcp mcp-server memory memory-cache memory-hacking metal on-device-ai rag rag-pipeline swift vector-database vector-embeddings vector-search vectordb

Last synced: 04 Mar 2026

https://github.com/redis-developer/arxivchatguru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 15 May 2025

https://github.com/redis-developer/ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 11 Apr 2025

https://github.com/redis-developer/ArxivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 18 Jul 2025

https://github.com/ArcadeData/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 23 Apr 2025

https://github.com/philippgille/chromem-go

Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.

chroma chromadb cosine-similarity embedded embeddings go golang in-memory llm llms nearest-neighbor rag retrieval-augmented-generation vector-database vector-search

Last synced: 05 Apr 2025

https://github.com/Anush008/fastembed-rs

Rust library for generating vector embeddings, reranking locally

embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search

Last synced: 01 May 2025

https://github.com/kelindar/search

Go library for embedded vector search and semantic embeddings using llama.cpp

ai bert embeddings gguf gpu llamacpp search-engine semantic-search simd vector-search

Last synced: 16 May 2025

https://github.com/ukplab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

bert domain-adaptation information-retrieval nlp transformers vector-search

Last synced: 18 Jun 2025

https://github.com/UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

bert domain-adaptation information-retrieval nlp transformers vector-search

Last synced: 14 Jul 2025

https://github.com/qdrant/vector-db-benchmark

Framework for benchmarking vector search engines

benchmark vector-database vector-search vector-search-engine

Last synced: 15 May 2025

https://github.com/edwinkys/oasysdb

An embedded vector database designed to run on edge devices. Lightweight and fast with HNSW indexing algorithm.

approximate-nearest-neighbors edge-ai edge-computing hnsw key-value-database open-source rest-api similarity-search vector-database vector-search

Last synced: 01 Mar 2026

https://github.com/superlinked/VectorHub

VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.

ai llm llmops ml mlops vector vector-database vector-search vectorops

Last synced: 10 Apr 2025