Projects in Awesome Lists tagged with vector-search

https://github.com/meilisearch/meilisearch

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors

Last synced: 11 Mar 2026

https://github.com/meilisearch/Meilisearch

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors

Last synced: 07 May 2025

https://github.com/meilisearch/MeiliSearch

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors

Last synced: 29 Mar 2025

https://github.com/milvus-io/milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

anns cloud-native diskann distributed embedding-database embedding-similarity embedding-store faiss golang hnsw image-search llm nearest-neighbor-search rag vector-database vector-search vector-similarity vector-store

Last synced: 22 May 2026

https://github.com/onyx-dot-app/onyx

Open Source AI Platform - AI Chat with advanced features that works with every LLM

ai ai-chat chatgpt chatui enterprise-search gen-ai information-retrieval llm llm-ui nextjs python rag self-hosted vector-search

Last synced: 02 Jun 2026

https://github.com/dragonflydb/dragonfly

A modern replacement for Redis and Memcached

cache cpp database fibers hacktoberfest in-memory in-memory-database key-value keydb memcached message-broker multi-threading nosql redis valkey vector-search

Last synced: 12 May 2025

https://github.com/qdrant/qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

ai-search ai-search-engine embeddings-similarity hnsw image-search knn-algorithm machine-learning mlops nearest-neighbor-search neural-network neural-search recommender-system search search-engine search-engines similarity-search vector-database vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

algolia datastore elasticsearch enterprise-search faceting full-text-search fuzzy-search geosearch in-memory instantsearch merchandising pinecone search search-engine semantic-search similarity-search site-search synonyms typo-tolerance vector-search

Last synced: 13 May 2025

https://github.com/weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.

approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate

Last synced: 02 Jun 2026

https://github.com/tencent/weknora

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

agent agentic ai chatbot chatbots embeddings evaluation generative-ai golang knowledge-base llm multi-tenant multimodel ollama openai question-answering rag reranking semantic-search vector-search

Last synced: 15 Apr 2026

https://github.com/StarTrail-org/LEANN

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 02 Jun 2026

https://github.com/neuml/txtai

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

ai artificial-intelligence embeddings information-retrieval language-model large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search

Last synced: 12 May 2025

https://github.com/yichuan-w/leann

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 08 Mar 2026

https://github.com/voxel51/fiftyone

Refine high-quality datasets and visual AI models

active-learning artificial-intelligence computer-vision data-centric-ai data-cleaning data-curation data-quality data-science deep-learning developer-tools image-classification machine-learning object-detection python unstructured-data vector-search visualization

Last synced: 19 Feb 2026

https://github.com/oramasearch/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/databendlabs/databend

Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.

ai bigdata cloud-native database elasticsearch geospatial lakehouse olap rust serverless snowflake sql vector-database vector-search

Last synced: 18 May 2026

https://github.com/askorama/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 09 Apr 2025

https://neuml.github.io/txtai/

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

embeddings information-retrieval language-model large-language-models llm machine-learning neural-search nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search vector-search-engine

Last synced: 25 Sep 2025

https://github.com/srbhr/resume-matcher

Resume Matcher is an open source, free tool to improve your resume. It works by using AI, Reader LLMs, to compare and rank resumes with job descriptions.

applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings

Last synced: 08 May 2025

https://github.com/zilliztech/gptcache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search

Last synced: 12 May 2025

https://github.com/zilliztech/GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search

Last synced: 24 Mar 2025

https://github.com/vespa-engine/vespa

AI + Data, online. https://vespa.ai

ai big-data java machine-learning rag search search-engine server serving-recommendation tensor vector vector-database vector-search vespa

Last synced: 01 Apr 2026

https://github.com/superduper-io/superduper

Superduper: End-to-end framework for building custom AI applications and agents.

ai chatbot data database distributed-ml inference llm-inference llm-serving llmops ml mlops mongodb pretrained-models python pytorch rag semantic-search torch transformers vector-search

Last synced: 14 May 2025

https://github.com/srbhr/Resume-Matcher

Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.

applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings

Last synced: 26 Mar 2025

https://github.com/microsoft/sptag

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.

approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search

Last synced: 07 May 2025

https://github.com/microsoft/SPTAG

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.

approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search

Last synced: 15 Mar 2025

https://github.com/ravendb/ravendb

ACID Document Database

csharp database document-database dotnet full-text-search indexing iot nosql ravendb search-engine sharding spatial time-series vector-search

Last synced: 13 May 2025

https://github.com/Tencent/TencentDB-Agent-Memory

TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.

agent ai-agent embedding llm local-first long-term-memory memory openclaw-plugin vector-search

Last synced: 28 May 2026

https://github.com/infiniflow/infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

ai-native approximate-nearest-neighbor-search bm25 cpp20 cpp20-modules embedding full-text-search hnsw hybrid-search information-retrival nearest-neighbor-search rag search-engine tensor-database vector vector-database vector-search vectordatabase

Last synced: 02 Apr 2026

https://github.com/RyanCodrai/turbovec

A vector index built on TurboQuant, written in Rust with Python bindings

ann avx512 embedding embeddings faiss nearest-neighbor neon python quant quantization rag rust simd turboquant vector-search

Last synced: 02 Jun 2026

https://github.com/pashpashpash/vault-ai

OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.

ai artificial-intelligence chatgpt generative go golang knowledge-base long-term-memory machine-learning openai openai-api pdf-support pinecone qdrant-vector-database question-answering react reactjs vector-search

Last synced: 14 May 2025

https://github.com/hegelai/prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

deep-learning developer-tools embeddings large-language-models llms machine-learning prompt-engineering python vector-search

Last synced: 14 May 2025

https://github.com/chonkie-ai/chonkie

🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

ai chunking etl nlp python rag retrieval semantic-segmentation text-chunking text-processing text-splitting vector-search

Last synced: 14 May 2025

https://github.com/gerevai/gerev

🧠 AI-powered enterprise search engine 🔎

ai chatgpt confluence enterprise-search helpdesk helpdesk-tools llama-index machine-learning search-engine semantic-search-engine similarity-search sysadmin tech-support technical-support vector-search workplace-search

Last synced: 15 May 2025

https://github.com/GerevAI/gerev

🧠 AI-powered enterprise search engine 🔎

ai chatgpt confluence enterprise-search helpdesk helpdesk-tools llama-index machine-learning search-engine semantic-search-engine similarity-search sysadmin tech-support technical-support vector-search workplace-search

Last synced: 24 Mar 2025

https://github.com/cheshire-cat-ai/core

AI agent microservice

agent ai assistant bot bot-framework chatbot conversational conversational-forms docker framework function-calling llm plugin python vector-search

Last synced: 13 May 2025

https://github.com/hora-search/hora

🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .

algorithm approximate-nearest-neighbor-search artificial-intelligence data-structures high-performance hnsw image-search k-nearest-neighbors machine-learning neural-network numeric recommender-system rust rust-sci search-engine simd similarity-search vector-search

Last synced: 14 May 2025

https://github.com/unum-cloud/usearch

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly

Last synced: 16 Apr 2026

https://github.com/vearch/vearch

Distributed vector search for AI-native applications

ai-native ai-native-database cloud-native document-retrieval embeddings hybrid-search rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 13 May 2025

https://github.com/devflowinc/trieve

All-in-one infrastructure for search, recommendations, RAG, and analytics offered via API

actix actix-web ai artificial-intelligence diesel embedding hacktoberfest llm postgresql qdrant qdrant-vector-database rag retrieval-augmented-generation rust search search-engine solidjs tailwindcss vector-search

Last synced: 13 May 2025

https://github.com/seekstorm/seekstorm

SeekStorm: vector & lexical search - in-process library & multi-tenancy server, in Rust.

ai-search bm25 dense-retrieval enterprise-search faceting full-text-search geosearch hybrid-search lexical-search neural-search realtime search search-engine search-server search-service semantic-search sparse-retrieval vector-database vector-search vector-search-engine

Last synced: 19 Apr 2026

https://github.com/asg017/sqlite-vss

A SQLite extension for efficient vector search, based on Faiss!

faiss sqlite sqlite-extension vector-search

Last synced: 14 May 2025

https://github.com/mintplex-labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 31 Oct 2025

https://github.com/Mintplex-Labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 24 Mar 2025

https://github.com/patterns-ai-core/langchainrb

Build LLM-powered applications in Ruby

agents ai-agents artificial-intelligence machine-learning ml rubyml vector-search

Last synced: 12 May 2025

https://github.com/datastax/jvector

JVector: the most advanced embedded vector search engine

ann java knn machine-learning search-engine similarity-search vector-search

Last synced: 09 Jun 2026

https://github.com/ashvardanian/NumKong

SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types — from 6-bit floats to 64-bit complex — across x86, Arm, RISC-V, and WASM, with bindings for Python, Rust, C, C++, Swift, JS, and Go 📐

arm-neon assembly blas cpp golang information-retrieval javascript matrix-multiplication metrics numpy rust scipy simd swift tensor vector-search

Last synced: 22 Mar 2026

https://github.com/supabase-community/nextjs-openai-doc-search

Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.

ai chatgpt nextjs openai postgres supabase template vector-search

Last synced: 15 May 2025

https://github.com/supabase-community/nextjs-openai-doc-search?og=v2

Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.

ai chatgpt nextjs openai postgres supabase template vector-search

Last synced: 21 Apr 2025

https://github.com/supabase-community/nextjs-openai-doc-search?og=v2+%22Template+for+building+your+own+custom+ChatGPT+style+doc+search+powered+by+Next.js%2C+OpenAI%2C+and+Supabase.%22

Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.

ai chatgpt nextjs openai postgres supabase template vector-search

Last synced: 09 Apr 2025

https://github.com/jbellis/jvector

JVector: the most advanced embedded vector search engine

ann java knn machine-learning search-engine similarity-search vector-search

Last synced: 13 Mar 2025

https://github.com/qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

embeddings openai rag retrieval retrieval-augmented-generation vector-search

Last synced: 26 Mar 2025

https://github.com/superlinked/superlinked

Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

data-pipeline deep-learning embeddings etl information-retrieval llm ml mlops natural-language-processing nlp python retrieval retrieval-augmented-generation semantic-search vector-database vector-search vectorization

Last synced: 16 Jan 2026

https://github.com/andreibondarev/langchainrb

Build LLM-powered applications in Ruby

agents ai-agents artificial-intelligence machine-learning ml rubyml vector-search

Last synced: 16 Mar 2025

https://github.com/ashvardanian/simsimd

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 13 May 2025

https://github.com/memfreeme/memfree

MemFree - Hybrid AI Search Engine & AI Page Generator

ai ai-search ai-search-engine devfast generate-ui hacktoberfest hacktoberfest-accepted hybrid-ai-search page-generator react search-engine serverless-vector shadcn-ui vector-search

Last synced: 14 May 2025

https://github.com/giancarloerra/socraticode

Enterprise-grade (40m+ LOC) codebase intelligence, zero-setup, private & local Plugin/Skill or MCP: hybrid semantic search, polyglot dependency graphs, symbol-level impact analysis & call-flow, interactive HTML viewer, cross-project & branch-aware search, DB/API/infra knowledge. 61% less tokens, 84% fewer calls, 37x faster. Cloud in private beta.

ai ai-assistant ast claude claude-code code-graph codebase-intelligence context-engine docker embeddings gemini gemini-cli-extension mcp openai qdrant semantic semantic-search vector-database vector-embeddings vector-search

Last synced: 04 May 2026

https://github.com/ashvardanian/SimSIMD

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 23 Mar 2025

https://github.com/unum-cloud/UForm

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search

Last synced: 19 Apr 2026

https://github.com/unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search

Last synced: 14 May 2025

https://github.com/qdrant/qdrant-client

Python client for Qdrant vector search engine

qdrant vector-database vector-search vector-search-engine

Last synced: 08 Jul 2025

https://github.com/zilliztech/vectordbbench

Benchmark for vector databases.

benchmark cost-effectiveness performance vector-database vector-search vectordb

Last synced: 12 Feb 2026

https://github.com/myscale/myscaledb

A @ClickHouse fork that supports high-performance vector search and full-text search.

ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb

Last synced: 12 Jan 2026

https://github.com/tantaraio/voy

🕸️🦀 A WASM vector similarity search written in Rust

k-d-tree nearest-neighbor-search rust similarity-search vector-search wasm wasm-pack webassembly

Last synced: 01 Apr 2025

https://github.com/superlinear-ai/raglite

🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite

chainlit colbert evals hybrid-search late-chunking late-interaction llm markdown pdf pgvector postgres postgresql query-adapter rag reranker reranking retrieval-augmented-generation sqlite tsvector vector-search

Last synced: 14 May 2025

https://github.com/myscale/MyScaleDB

A @ClickHouse fork that supports high-performance vector search and full-text search.

ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb

Last synced: 12 Mar 2025

https://github.com/epsilla-cloud/vectordb

Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/

ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search

Last synced: 15 May 2025

https://github.com/rapidsai/raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

anns building-blocks clustering cuda distance gpu information-retrieval linear-algebra llm machine-learning nearest-neighbors neighborhood-methods primitives random-sampling solvers sparse statistics vector-search vector-similarity vector-store

Last synced: 14 May 2025

https://github.com/tensorchord/vectorchord

Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.

artificial-intelligence llmops postgresql vector-database vector-search

Last synced: 21 Jun 2025

https://github.com/azure/azure-search-vector-samples

A repository of code samples for Vector search capabilities in Azure AI Search.

azure azurecognitivesearch embeddings vector vector-search

Last synced: 14 May 2025

https://github.com/arcadedata/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 24 Apr 2026

https://github.com/Azure/azure-search-vector-samples

A repository of code samples for Vector search capabilities in Azure AI Search.

azure azurecognitivesearch embeddings vector vector-search

Last synced: 25 Mar 2025

https://github.com/prithivirajdamodaran/flashrank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 14 May 2025

https://github.com/weaviate/recipes

This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!

function-calling generative-ai llm-frameworks python retrieval-augmented-generation vector-database vector-search

Last synced: 15 May 2025

https://github.com/anush008/fastembed-rs

Rust library for vector embeddings and reranking. Inspired by qdrant/fastembed.

embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search

Last synced: 19 Feb 2026

https://github.com/nuclia/nucliadb

NucliaDB, The AI Search database for RAG

ai-powered-search database language-model machine-learning mlops nuclia python rust search search-engine search-engines semantic semantic-search-engine text-classification unstructured-data vector-search vector-search-engine vectors

Last synced: 14 May 2025

https://github.com/PrithivirajDamodaran/FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 08 May 2025

https://github.com/unum-cloud/UStore

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

acid apache-arrow arrow big-data bigdata database dataloader document-database graph-database iouring json key-value-store knn-search networkx nosql pandas python search spdk vector-search

Last synced: 09 Jun 2026

https://github.com/jina-ai/vectordb

A Python vector database you just need - no more, no less.

Last synced: 16 May 2025

https://github.com/unum-cloud/ustore

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

acid apache-arrow arrow big-data bigdata database dataloader document-database graph-database iouring json key-value-store knn-search networkx nosql pandas python search spdk vector-search

Last synced: 11 Apr 2025

https://github.com/christopherkarani/Wax

🍯 Memory layer for on-device AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer.

ai-agents cli coreml coreml-framework data-science machine-learning mcp mcp-server memory memory-cache memory-hacking metal on-device-ai rag rag-pipeline swift vector-database vector-embeddings vector-search vectordb

Last synced: 04 Mar 2026

https://github.com/redis-developer/arxivchatguru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 15 May 2025

https://github.com/redis-developer/ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 11 Apr 2025

https://github.com/redis-developer/ArxivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 18 Jul 2025

https://github.com/ArcadeData/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 23 Apr 2025

https://github.com/philippgille/chromem-go

Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.

chroma chromadb cosine-similarity embedded embeddings go golang in-memory llm llms nearest-neighbor rag retrieval-augmented-generation vector-database vector-search

Last synced: 05 Apr 2025

https://github.com/Anush008/fastembed-rs

Rust library for generating vector embeddings, reranking locally

embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search

Last synced: 01 May 2025

https://github.com/kelindar/search

Go library for embedded vector search and semantic embeddings using llama.cpp

ai bert embeddings gguf gpu llamacpp search-engine semantic-search simd vector-search

Last synced: 16 May 2025

https://github.com/microsoft/rag-time

RAG Time: A 5-week Learning Journey to Mastering RAG

ai azure binary-quantization generative-ai gpt hnsw hybrid-search indexing keyword-search language-model llm matryoshka-representation-learning multimodal openai rag responsible-ai retrieval-augmented-generation scalar-quantization vector-search visual-studio-code

Last synced: 15 May 2025

https://github.com/rapidsai/cuvs

cuVS - a library for vector search and clustering on the GPU

anns clustering cuda distance gpu information-retrieval llm machine-learning nearest-neighbors neighborhood-methods similarity-search sparse statistics vector-search vector-similarity vector-store

Last synced: 08 Apr 2026

https://github.com/m1guelpf/tinyvector

A tiny embedding database in pure Rust.

embeddings embeddings-similarity machine-learning rust search-engines similarity-search vector-database vector-search

Last synced: 23 Oct 2025

https://github.com/bbc-esq/vectordb-plugin

Plugin that lets you ask questions about your documents including audio and video files.

bark database-management embedding-models embedding-vectors embeddings gtts koboldai koboldcpp python rag retrieval-augmented-generation retrieval-chatbot tiledb vector-data-management vector-database vector-search vision whisper whispers2t whisperspeech

Last synced: 16 May 2025

https://github.com/ukplab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

bert domain-adaptation information-retrieval nlp transformers vector-search

Last synced: 18 Jun 2025

https://github.com/raphaelsty/cherche

Neural Search

bm25 flashtext information-retrieval machine-learning natural-language-processing neural-networks neural-search nlp question-answering reader retrieval search searching semantic-search vector-search

Last synced: 11 Oct 2025

https://github.com/weaviate/weaviate-examples

Weaviate vector database – examples

deep-learning examples vector-database vector-search vector-search-engine weaviate

Last synced: 28 Jan 2026

https://github.com/UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

bert domain-adaptation information-retrieval nlp transformers vector-search

Last synced: 14 Jul 2025

https://github.com/qdrant/vector-db-benchmark

Framework for benchmarking vector search engines

benchmark vector-database vector-search vector-search-engine

Last synced: 15 May 2025

https://github.com/vector-ai/vectorai

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

artificial-intelligence clustering compare-vectors deep-learning embeddings encodings machine-learning neural-networks python pytorch search search-engine semantic-search tensorflow transformers vector vector-analytics vector-search vector-similarity vector-similarity-database

Last synced: 04 Apr 2025

https://github.com/edwinkys/oasysdb

An embedded vector database designed to run on edge devices. Lightweight and fast with HNSW indexing algorithm.

approximate-nearest-neighbors edge-ai edge-computing hnsw key-value-database open-source rest-api similarity-search vector-database vector-search

Last synced: 01 Mar 2026

https://github.com/superlinked/VectorHub

VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.

ai llm llmops ml mlops vector vector-database vector-search vectorops

Last synced: 10 Apr 2025