Projects in Awesome Lists tagged with vector-search
A curated list of projects in awesome lists tagged with vector-search .
https://github.com/meilisearch/meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors
Last synced: 11 Mar 2026
https://github.com/meilisearch/Meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors
Last synced: 07 May 2025
https://github.com/meilisearch/MeiliSearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors
Last synced: 29 Mar 2025
https://github.com/milvus-io/milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
anns cloud-native diskann distributed embedding-database embedding-similarity embedding-store faiss golang hnsw image-search llm nearest-neighbor-search rag vector-database vector-search vector-similarity vector-store
Last synced: 22 May 2026
https://github.com/onyx-dot-app/onyx
Open Source AI Platform - AI Chat with advanced features that works with every LLM
ai ai-chat chatgpt chatui enterprise-search gen-ai information-retrieval llm llm-ui nextjs python rag self-hosted vector-search
Last synced: 02 Jun 2026
https://github.com/dragonflydb/dragonfly
A modern replacement for Redis and Memcached
cache cpp database fibers hacktoberfest in-memory in-memory-database key-value keydb memcached message-broker multi-threading nosql redis valkey vector-search
Last synced: 12 May 2025
https://github.com/qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
ai-search ai-search-engine embeddings-similarity hnsw image-search knn-algorithm machine-learning mlops nearest-neighbor-search neural-network neural-search recommender-system search search-engine search-engines similarity-search vector-database vector-search vector-search-engine
Last synced: 12 May 2025
https://github.com/typesense/typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch β‘ π β¨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
algolia datastore elasticsearch enterprise-search faceting full-text-search fuzzy-search geosearch in-memory instantsearch merchandising pinecone search search-engine semantic-search similarity-search site-search synonyms typo-tolerance vector-search
Last synced: 13 May 2025
https://github.com/weaviate/weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native databaseβ.
approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate
Last synced: 02 Jun 2026
https://github.com/tencent/weknora
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
agent agentic ai chatbot chatbots embeddings evaluation generative-ai golang knowledge-base llm multi-tenant multimodel ollama openai question-answering rag reranking semantic-search vector-search
Last synced: 15 Apr 2026
https://github.com/StarTrail-org/LEANN
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors
Last synced: 02 Jun 2026
https://github.com/neuml/txtai
π‘ All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
ai artificial-intelligence embeddings information-retrieval language-model large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search
Last synced: 12 May 2025
https://github.com/yichuan-w/leann
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors
Last synced: 08 Mar 2026
https://github.com/voxel51/fiftyone
Refine high-quality datasets and visual AI models
active-learning artificial-intelligence computer-vision data-centric-ai data-cleaning data-curation data-quality data-science deep-learning developer-tools image-classification machine-learning object-detection python unstructured-data vector-search visualization
Last synced: 19 Feb 2026
https://github.com/oramasearch/orama
π A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine
Last synced: 12 May 2025
https://github.com/databendlabs/databend
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. β rebuilt from scratch. Unified architecture on your S3.
ai bigdata cloud-native database elasticsearch geospatial lakehouse olap rust serverless snowflake sql vector-database vector-search
Last synced: 18 May 2026
https://github.com/askorama/orama
π A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine
Last synced: 09 Apr 2025
https://neuml.github.io/txtai/
π‘ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
embeddings information-retrieval language-model large-language-models llm machine-learning neural-search nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search vector-search-engine
Last synced: 25 Sep 2025
https://github.com/activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search
Last synced: 11 Feb 2026
https://github.com/srbhr/resume-matcher
Resume Matcher is an open source, free tool to improve your resume. It works by using AI, Reader LLMs, to compare and rank resumes with job descriptions.
applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings
Last synced: 08 May 2025
https://github.com/zilliztech/gptcache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search
Last synced: 12 May 2025
https://github.com/zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search
Last synced: 24 Mar 2025
https://github.com/vespa-engine/vespa
AI + Data, online. https://vespa.ai
ai big-data java machine-learning rag search search-engine server serving-recommendation tensor vector vector-database vector-search vespa
Last synced: 01 Apr 2026
https://github.com/superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents.
ai chatbot data database distributed-ml inference llm-inference llm-serving llmops ml mlops mongodb pretrained-models python pytorch rag semantic-search torch transformers vector-search
Last synced: 14 May 2025
https://github.com/srbhr/Resume-Matcher
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings
Last synced: 26 Mar 2025
https://github.com/microsoft/sptag
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search
Last synced: 07 May 2025
https://github.com/microsoft/SPTAG
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search
Last synced: 15 Mar 2025
https://github.com/ravendb/ravendb
ACID Document Database
csharp database document-database dotnet full-text-search indexing iot nosql ravendb search-engine sharding spatial time-series vector-search
Last synced: 13 May 2025
https://github.com/Tencent/TencentDB-Agent-Memory
TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.
agent ai-agent embedding llm local-first long-term-memory memory openclaw-plugin vector-search
Last synced: 28 May 2026
https://github.com/infiniflow/infinity
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
ai-native approximate-nearest-neighbor-search bm25 cpp20 cpp20-modules embedding full-text-search hnsw hybrid-search information-retrival nearest-neighbor-search rag search-engine tensor-database vector vector-database vector-search vectordatabase
Last synced: 02 Apr 2026
https://github.com/RyanCodrai/turbovec
A vector index built on TurboQuant, written in Rust with Python bindings
ann avx512 embedding embeddings faiss nearest-neighbor neon python quant quantization rag rust simd turboquant vector-search
Last synced: 02 Jun 2026
https://github.com/pashpashpash/vault-ai
OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.
ai artificial-intelligence chatgpt generative go golang knowledge-base long-term-memory machine-learning openai openai-api pdf-support pinecone qdrant-vector-database question-answering react reactjs vector-search
Last synced: 14 May 2025
https://github.com/hegelai/prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
deep-learning developer-tools embeddings large-language-models llms machine-learning prompt-engineering python vector-search
Last synced: 14 May 2025
https://github.com/chonkie-ai/chonkie
π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking library
ai chunking etl nlp python rag retrieval semantic-segmentation text-chunking text-processing text-splitting vector-search
Last synced: 14 May 2025
https://github.com/gerevai/gerev
π§ AI-powered enterprise search engine π
ai chatgpt confluence enterprise-search helpdesk helpdesk-tools llama-index machine-learning search-engine semantic-search-engine similarity-search sysadmin tech-support technical-support vector-search workplace-search
Last synced: 15 May 2025
https://github.com/GerevAI/gerev
π§ AI-powered enterprise search engine π
ai chatgpt confluence enterprise-search helpdesk helpdesk-tools llama-index machine-learning search-engine semantic-search-engine similarity-search sysadmin tech-support technical-support vector-search workplace-search
Last synced: 24 Mar 2025
https://github.com/cheshire-cat-ai/core
AI agent microservice
agent ai assistant bot bot-framework chatbot conversational conversational-forms docker framework function-calling llm plugin python vector-search
Last synced: 13 May 2025
https://github.com/hora-search/hora
π efficient approximate nearest neighbor search algorithm collections library written in Rust π¦ .
algorithm approximate-nearest-neighbor-search artificial-intelligence data-structures high-performance hnsw image-search k-nearest-neighbors machine-learning neural-network numeric recommender-system rust rust-sci search-engine simd similarity-search vector-search
Last synced: 14 May 2025
https://github.com/unum-cloud/usearch
Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram π
approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly
Last synced: 16 Apr 2026
https://github.com/vearch/vearch
Distributed vector search for AI-native applications
ai-native ai-native-database cloud-native document-retrieval embeddings hybrid-search rag retrieval-augmented-generation vector-database vector-search vectors
Last synced: 13 May 2025
https://github.com/devflowinc/trieve
All-in-one infrastructure for search, recommendations, RAG, and analytics offered via API
actix actix-web ai artificial-intelligence diesel embedding hacktoberfest llm postgresql qdrant qdrant-vector-database rag retrieval-augmented-generation rust search search-engine solidjs tailwindcss vector-search
Last synced: 13 May 2025
https://github.com/seekstorm/seekstorm
SeekStorm: vector & lexical search - in-process library & multi-tenancy server, in Rust.
ai-search bm25 dense-retrieval enterprise-search faceting full-text-search geosearch hybrid-search lexical-search neural-search realtime search search-engine search-server search-service semantic-search sparse-retrieval vector-database vector-search vector-search-engine
Last synced: 19 Apr 2026
https://github.com/asg017/sqlite-vss
A SQLite extension for efficient vector search, based on Faiss!
faiss sqlite sqlite-extension vector-search
Last synced: 14 May 2025
https://github.com/mintplex-labs/vector-admin
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate
Last synced: 31 Oct 2025
https://github.com/Mintplex-Labs/vector-admin
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate
Last synced: 24 Mar 2025
https://github.com/patterns-ai-core/langchainrb
Build LLM-powered applications in Ruby
agents ai-agents artificial-intelligence machine-learning ml rubyml vector-search
Last synced: 12 May 2025
https://github.com/datastax/jvector
JVector: the most advanced embedded vector search engine
ann java knn machine-learning search-engine similarity-search vector-search
Last synced: 03 Apr 2026
https://github.com/ashvardanian/NumKong
SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types β from 6-bit floats to 64-bit complex β across x86, Arm, RISC-V, and WASM, with bindings for Python, Rust, C, C++, Swift, JS, and Go π
arm-neon assembly blas cpp golang information-retrieval javascript matrix-multiplication metrics numpy rust scipy simd swift tensor vector-search
Last synced: 22 Mar 2026
https://github.com/supabase-community/nextjs-openai-doc-search
Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.
ai chatgpt nextjs openai postgres supabase template vector-search
Last synced: 15 May 2025
https://github.com/supabase-community/nextjs-openai-doc-search?og=v2
Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.
ai chatgpt nextjs openai postgres supabase template vector-search
Last synced: 21 Apr 2025
https://github.com/supabase-community/nextjs-openai-doc-search?og=v2+%22Template+for+building+your+own+custom+ChatGPT+style+doc+search+powered+by+Next.js%2C+OpenAI%2C+and+Supabase.%22
Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.
ai chatgpt nextjs openai postgres supabase template vector-search
Last synced: 09 Apr 2025
https://github.com/jbellis/jvector
JVector: the most advanced embedded vector search engine
ann java knn machine-learning search-engine similarity-search vector-search
Last synced: 13 Mar 2025
https://github.com/qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
embeddings openai rag retrieval retrieval-augmented-generation vector-search
Last synced: 26 Mar 2025
https://github.com/superlinked/superlinked
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
data-pipeline deep-learning embeddings etl information-retrieval llm ml mlops natural-language-processing nlp python retrieval retrieval-augmented-generation semantic-search vector-database vector-search vectorization
Last synced: 16 Jan 2026
https://github.com/andreibondarev/langchainrb
Build LLM-powered applications in Ruby
agents ai-agents artificial-intelligence machine-learning ml rubyml vector-search
Last synced: 16 Mar 2025
https://github.com/ashvardanian/simsimd
Up to 200x Faster Dot Products & Similarity Metrics β for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 π
arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 13 May 2025
https://github.com/memfreeme/memfree
MemFree - Hybrid AI Search Engine & AI Page Generator
ai ai-search ai-search-engine devfast generate-ui hacktoberfest hacktoberfest-accepted hybrid-ai-search page-generator react search-engine serverless-vector shadcn-ui vector-search
Last synced: 14 May 2025
https://github.com/giancarloerra/socraticode
Enterprise-grade (40m+ LOC) codebase intelligence, zero-setup, private & local Plugin/Skill or MCP: hybrid semantic search, polyglot dependency graphs, symbol-level impact analysis & call-flow, interactive HTML viewer, cross-project & branch-aware search, DB/API/infra knowledge. 61% less tokens, 84% fewer calls, 37x faster. Cloud in private beta.
ai ai-assistant ast claude claude-code code-graph codebase-intelligence context-engine docker embeddings gemini gemini-cli-extension mcp openai qdrant semantic semantic-search vector-database vector-embeddings vector-search
Last synced: 04 May 2026
https://github.com/ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics β for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 π
arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 23 Mar 2025
https://github.com/unum-cloud/UForm
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search
Last synced: 19 Apr 2026
https://github.com/unum-cloud/uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search
Last synced: 14 May 2025
https://github.com/qdrant/qdrant-client
Python client for Qdrant vector search engine
qdrant vector-database vector-search vector-search-engine
Last synced: 08 Jul 2025
https://github.com/zilliztech/vectordbbench
Benchmark for vector databases.
benchmark cost-effectiveness performance vector-database vector-search vectordb
Last synced: 12 Feb 2026
https://github.com/myscale/myscaledb
A @ClickHouse fork that supports high-performance vector search and full-text search.
ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb
Last synced: 12 Jan 2026
https://github.com/tantaraio/voy
πΈοΈπ¦ A WASM vector similarity search written in Rust
k-d-tree nearest-neighbor-search rust similarity-search vector-search wasm wasm-pack webassembly
Last synced: 01 Apr 2025
https://github.com/superlinear-ai/raglite
π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite
chainlit colbert evals hybrid-search late-chunking late-interaction llm markdown pdf pgvector postgres postgresql query-adapter rag reranker reranking retrieval-augmented-generation sqlite tsvector vector-search
Last synced: 14 May 2025
https://github.com/myscale/MyScaleDB
A @ClickHouse fork that supports high-performance vector search and full-text search.
ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb
Last synced: 12 Mar 2025
https://github.com/epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search
Last synced: 15 May 2025
https://github.com/rapidsai/raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
anns building-blocks clustering cuda distance gpu information-retrieval linear-algebra llm machine-learning nearest-neighbors neighborhood-methods primitives random-sampling solvers sparse statistics vector-search vector-similarity vector-store
Last synced: 14 May 2025
https://github.com/tensorchord/vectorchord
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
artificial-intelligence llmops postgresql vector-database vector-search
Last synced: 21 Jun 2025
https://github.com/azure/azure-search-vector-samples
A repository of code samples for Vector search capabilities in Azure AI Search.
azure azurecognitivesearch embeddings vector vector-search
Last synced: 14 May 2025
https://github.com/arcadedata/arcadedb
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search
Last synced: 24 Apr 2026
https://github.com/Azure/azure-search-vector-samples
A repository of code samples for Vector search capabilities in Azure AI Search.
azure azurecognitivesearch embeddings vector vector-search
Last synced: 25 Mar 2025
https://github.com/prithivirajdamodaran/flashrank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search
Last synced: 14 May 2025
https://github.com/weaviate/recipes
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!
function-calling generative-ai llm-frameworks python retrieval-augmented-generation vector-database vector-search
Last synced: 15 May 2025
https://github.com/anush008/fastembed-rs
Rust library for vector embeddings and reranking. Inspired by qdrant/fastembed.
embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search
Last synced: 19 Feb 2026
https://github.com/nuclia/nucliadb
NucliaDB, The AI Search database for RAG
ai-powered-search database language-model machine-learning mlops nuclia python rust search search-engine search-engines semantic semantic-search-engine text-classification unstructured-data vector-search vector-search-engine vectors
Last synced: 14 May 2025
https://github.com/PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search
Last synced: 08 May 2025
https://github.com/jina-ai/vectordb
A Python vector database you just need - no more, no less.
embedding-similarity neural-search sentence-embeddings vector-database vector-database-embedding vector-search
Last synced: 16 May 2025
https://github.com/unum-cloud/ustore
Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang ποΈ
acid apache-arrow arrow big-data bigdata database dataloader document-database graph-database iouring json key-value-store knn-search networkx nosql pandas python search spdk vector-search
Last synced: 11 Apr 2025
https://github.com/christopherkarani/Wax
π― Memory layer for on-device AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer.
ai-agents cli coreml coreml-framework data-science machine-learning mcp mcp-server memory memory-cache memory-hacking metal on-device-ai rag rag-pipeline swift vector-database vector-embeddings vector-search vectordb
Last synced: 04 Mar 2026
https://github.com/redis-developer/arxivchatguru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search
Last synced: 15 May 2025
https://github.com/redis-developer/ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search
Last synced: 11 Apr 2025
https://github.com/redis-developer/ArxivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search
Last synced: 18 Jul 2025
https://github.com/ArcadeData/arcadedb
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search
Last synced: 23 Apr 2025
https://github.com/philippgille/chromem-go
Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.
chroma chromadb cosine-similarity embedded embeddings go golang in-memory llm llms nearest-neighbor rag retrieval-augmented-generation vector-database vector-search
Last synced: 05 Apr 2025
https://github.com/Anush008/fastembed-rs
Rust library for generating vector embeddings, reranking locally
embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search
Last synced: 01 May 2025
https://github.com/kelindar/search
Go library for embedded vector search and semantic embeddings using llama.cpp
ai bert embeddings gguf gpu llamacpp search-engine semantic-search simd vector-search
Last synced: 16 May 2025
https://github.com/microsoft/rag-time
RAG Time: A 5-week Learning Journey to Mastering RAG
ai azure binary-quantization generative-ai gpt hnsw hybrid-search indexing keyword-search language-model llm matryoshka-representation-learning multimodal openai rag responsible-ai retrieval-augmented-generation scalar-quantization vector-search visual-studio-code
Last synced: 15 May 2025
https://github.com/rapidsai/cuvs
cuVS - a library for vector search and clustering on the GPU
anns clustering cuda distance gpu information-retrieval llm machine-learning nearest-neighbors neighborhood-methods similarity-search sparse statistics vector-search vector-similarity vector-store
Last synced: 08 Apr 2026
https://github.com/m1guelpf/tinyvector
A tiny embedding database in pure Rust.
embeddings embeddings-similarity machine-learning rust search-engines similarity-search vector-database vector-search
Last synced: 23 Oct 2025
https://github.com/bbc-esq/vectordb-plugin
Plugin that lets you ask questions about your documents including audio and video files.
bark database-management embedding-models embedding-vectors embeddings gtts koboldai koboldcpp python rag retrieval-augmented-generation retrieval-chatbot tiledb vector-data-management vector-database vector-search vision whisper whispers2t whisperspeech
Last synced: 16 May 2025
https://github.com/ukplab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
bert domain-adaptation information-retrieval nlp transformers vector-search
Last synced: 18 Jun 2025
https://github.com/weaviate/weaviate-examples
Weaviate vector database β examples
deep-learning examples vector-database vector-search vector-search-engine weaviate
Last synced: 28 Jan 2026
https://github.com/UKPLab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
bert domain-adaptation information-retrieval nlp transformers vector-search
Last synced: 14 Jul 2025
https://github.com/qdrant/vector-db-benchmark
Framework for benchmarking vector search engines
benchmark vector-database vector-search vector-search-engine
Last synced: 15 May 2025
https://github.com/vector-ai/vectorai
Vector AI β A platform for building vector based applications. Encode, query and analyse data using vectors.
artificial-intelligence clustering compare-vectors deep-learning embeddings encodings machine-learning neural-networks python pytorch search search-engine semantic-search tensorflow transformers vector vector-analytics vector-search vector-similarity vector-similarity-database
Last synced: 04 Apr 2025
https://github.com/edwinkys/oasysdb
An embedded vector database designed to run on edge devices. Lightweight and fast with HNSW indexing algorithm.
approximate-nearest-neighbors edge-ai edge-computing hnsw key-value-database open-source rest-api similarity-search vector-database vector-search
Last synced: 01 Mar 2026
https://github.com/superlinked/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
ai llm llmops ml mlops vector vector-database vector-search vectorops
Last synced: 10 Apr 2025