Projects in Awesome Lists tagged with vector-search
A curated list of projects in awesome lists tagged with vector-search .
https://github.com/meilisearch/meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors
Last synced: 12 May 2025
https://github.com/meilisearch/Meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors
Last synced: 07 May 2025
https://github.com/meilisearch/MeiliSearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
ai api app-search database enterprise-search faceting full-text-search fuzzy-search geosearch hybrid-search instantsearch search search-as-you-type search-engine semantic-search site-search typo-tolerance vector-database vector-search vectors
Last synced: 29 Mar 2025
https://github.com/milvus-io/milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
anns cloud-native diskann distributed embedding-database embedding-similarity embedding-store faiss golang hnsw image-search llm nearest-neighbor-search rag vector-database vector-search vector-similarity vector-store
Last synced: 04 Jan 2026
https://github.com/dragonflydb/dragonfly
A modern replacement for Redis and Memcached
cache cpp database fibers hacktoberfest in-memory in-memory-database key-value keydb memcached message-broker multi-threading nosql redis valkey vector-search
Last synced: 12 May 2025
https://github.com/qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
ai-search ai-search-engine embeddings-similarity hnsw image-search knn-algorithm machine-learning mlops nearest-neighbor-search neural-network neural-search recommender-system search search-engine search-engines similarity-search vector-database vector-search vector-search-engine
Last synced: 12 May 2025
https://github.com/typesense/typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch β‘ π β¨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
algolia datastore elasticsearch enterprise-search faceting full-text-search fuzzy-search geosearch in-memory instantsearch merchandising pinecone search search-engine semantic-search similarity-search site-search synonyms typo-tolerance vector-search
Last synced: 13 May 2025
https://github.com/weaviate/weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native databaseβ.
approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate
Last synced: 05 Jan 2026
https://github.com/neuml/txtai
π‘ All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
ai artificial-intelligence embeddings information-retrieval language-model large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search
Last synced: 12 May 2025
https://github.com/voxel51/fiftyone
Refine high-quality datasets and visual AI models
active-learning artificial-intelligence computer-vision data-centric-ai data-cleaning data-curation data-quality data-science deep-learning developer-tools image-classification machine-learning object-detection python unstructured-data vector-search visualization
Last synced: 12 May 2025
https://github.com/oramasearch/orama
π A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine
Last synced: 12 May 2025
https://github.com/askorama/orama
π A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine
Last synced: 09 Apr 2025
https://neuml.github.io/txtai/
π‘ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
embeddings information-retrieval language-model large-language-models llm machine-learning neural-search nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search vector-search-engine
Last synced: 25 Sep 2025
https://github.com/srbhr/resume-matcher
Resume Matcher is an open source, free tool to improve your resume. It works by using AI, Reader LLMs, to compare and rank resumes with job descriptions.
applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings
Last synced: 08 May 2025
https://github.com/activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search
Last synced: 13 May 2025
https://github.com/zilliztech/gptcache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search
Last synced: 12 May 2025
https://github.com/zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
aigc autogpt babyagi chatbot chatgpt chatgpt-api dolly gpt langchain llama llama-index llm memcache milvus openai redis semantic-search similarity-search vector-search
Last synced: 24 Mar 2025
https://github.com/vespa-engine/vespa
AI + Data, online. https://vespa.ai
ai big-data cpp java machine-learning search-engine server serving serving-recommendation tensorflow vector-search vespa
Last synced: 06 Jan 2026
https://github.com/superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents.
ai chatbot data database distributed-ml inference llm-inference llm-serving llmops ml mlops mongodb pretrained-models python pytorch rag semantic-search torch transformers vector-search
Last synced: 14 May 2025
https://github.com/srbhr/Resume-Matcher
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings
Last synced: 26 Mar 2025
https://github.com/microsoft/sptag
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search
Last synced: 07 May 2025
https://github.com/microsoft/SPTAG
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
approximate-nearest-neighbor-search distributed-serving fresh-update neighborhood-graph space-partition-tree vector-search
Last synced: 15 Mar 2025
https://github.com/marqo-ai/marqo
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
chatgpt clip deep-learning gpt hacktoberfest hnsw information-retrieval knn large-language-models machine-learning machinelearning multi-modal natural-language-processing search-engine semantic-search tensor-search transformers vector-search vision-language visual-search
Last synced: 13 May 2025
https://github.com/ravendb/ravendb
ACID Document Database
csharp database document-database dotnet full-text-search indexing iot nosql ravendb search-engine sharding spatial time-series vector-search
Last synced: 13 May 2025
https://github.com/infiniflow/infinity
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
ai-native approximate-nearest-neighbor-search bm25 cpp20 cpp20-modules embedding full-text-search hnsw hybrid-search information-retrival nearest-neighbor-search rag search-engine tensor-database vector vector-database vector-search vectordatabase
Last synced: 12 May 2025
https://github.com/pashpashpash/vault-ai
OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.
ai artificial-intelligence chatgpt generative go golang knowledge-base long-term-memory machine-learning openai openai-api pdf-support pinecone qdrant-vector-database question-answering react reactjs vector-search
Last synced: 14 May 2025
https://github.com/hegelai/prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
deep-learning developer-tools embeddings large-language-models llms machine-learning prompt-engineering python vector-search
Last synced: 14 May 2025
https://github.com/chonkie-ai/chonkie
π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking library
ai chunking etl nlp python rag retrieval semantic-segmentation text-chunking text-processing text-splitting vector-search
Last synced: 14 May 2025
https://github.com/gerevai/gerev
π§ AI-powered enterprise search engine π
ai chatgpt confluence enterprise-search helpdesk helpdesk-tools llama-index machine-learning search-engine semantic-search-engine similarity-search sysadmin tech-support technical-support vector-search workplace-search
Last synced: 15 May 2025
https://github.com/GerevAI/gerev
π§ AI-powered enterprise search engine π
ai chatgpt confluence enterprise-search helpdesk helpdesk-tools llama-index machine-learning search-engine semantic-search-engine similarity-search sysadmin tech-support technical-support vector-search workplace-search
Last synced: 24 Mar 2025
https://github.com/cheshire-cat-ai/core
AI agent microservice
agent ai assistant bot bot-framework chatbot conversational conversational-forms docker framework function-calling llm plugin python vector-search
Last synced: 13 May 2025
https://github.com/hora-search/hora
π efficient approximate nearest neighbor search algorithm collections library written in Rust π¦ .
algorithm approximate-nearest-neighbor-search artificial-intelligence data-structures high-performance hnsw image-search k-nearest-neighbors machine-learning neural-network numeric recommender-system rust rust-sci search-engine simd similarity-search vector-search
Last synced: 14 May 2025
https://github.com/unum-cloud/usearch
Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram π
approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly
Last synced: 29 Mar 2025
https://github.com/vearch/vearch
Distributed vector search for AI-native applications
ai-native ai-native-database cloud-native document-retrieval embeddings hybrid-search rag retrieval-augmented-generation vector-database vector-search vectors
Last synced: 13 May 2025
https://github.com/devflowinc/trieve
All-in-one infrastructure for search, recommendations, RAG, and analytics offered via API
actix actix-web ai artificial-intelligence diesel embedding hacktoberfest llm postgresql qdrant qdrant-vector-database rag retrieval-augmented-generation rust search search-engine solidjs tailwindcss vector-search
Last synced: 13 May 2025
https://github.com/asg017/sqlite-vss
A SQLite extension for efficient vector search, based on Faiss!
faiss sqlite sqlite-extension vector-search
Last synced: 14 May 2025
https://github.com/mintplex-labs/vector-admin
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate
Last synced: 31 Oct 2025
https://github.com/Mintplex-Labs/vector-admin
The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.
ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate
Last synced: 24 Mar 2025
https://github.com/patterns-ai-core/langchainrb
Build LLM-powered applications in Ruby
agents ai-agents artificial-intelligence machine-learning ml rubyml vector-search
Last synced: 12 May 2025
https://github.com/supabase-community/nextjs-openai-doc-search
Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.
ai chatgpt nextjs openai postgres supabase template vector-search
Last synced: 15 May 2025
https://github.com/supabase-community/nextjs-openai-doc-search?og=v2
Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.
ai chatgpt nextjs openai postgres supabase template vector-search
Last synced: 21 Apr 2025
https://github.com/supabase-community/nextjs-openai-doc-search?og=v2+%22Template+for+building+your+own+custom+ChatGPT+style+doc+search+powered+by+Next.js%2C+OpenAI%2C+and+Supabase.%22
Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.
ai chatgpt nextjs openai postgres supabase template vector-search
Last synced: 09 Apr 2025
https://github.com/datastax/jvector
JVector: the most advanced embedded vector search engine
ann java knn machine-learning search-engine similarity-search vector-search
Last synced: 14 May 2025
https://github.com/jbellis/jvector
JVector: the most advanced embedded vector search engine
ann java knn machine-learning search-engine similarity-search vector-search
Last synced: 13 Mar 2025
https://github.com/qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
embeddings openai rag retrieval retrieval-augmented-generation vector-search
Last synced: 26 Mar 2025
https://github.com/andreibondarev/langchainrb
Build LLM-powered applications in Ruby
agents ai-agents artificial-intelligence machine-learning ml rubyml vector-search
Last synced: 16 Mar 2025
https://github.com/ashvardanian/simsimd
Up to 200x Faster Dot Products & Similarity Metrics β for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 π
arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 13 May 2025
https://github.com/memfreeme/memfree
MemFree - Hybrid AI Search Engine & AI Page Generator
ai ai-search ai-search-engine devfast generate-ui hacktoberfest hacktoberfest-accepted hybrid-ai-search page-generator react search-engine serverless-vector shadcn-ui vector-search
Last synced: 14 May 2025
https://github.com/ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics β for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 π
arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 23 Mar 2025
https://github.com/unum-cloud/uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search
Last synced: 14 May 2025
https://github.com/qdrant/qdrant-client
Python client for Qdrant vector search engine
qdrant vector-database vector-search vector-search-engine
Last synced: 08 Jul 2025
https://github.com/myscale/myscaledb
A @ClickHouse fork that supports high-performance vector search and full-text search.
ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb
Last synced: 01 Jul 2025
https://github.com/tantaraio/voy
πΈοΈπ¦ A WASM vector similarity search written in Rust
k-d-tree nearest-neighbor-search rust similarity-search vector-search wasm wasm-pack webassembly
Last synced: 01 Apr 2025
https://github.com/superlinear-ai/raglite
π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite
chainlit colbert evals hybrid-search late-chunking late-interaction llm markdown pdf pgvector postgres postgresql query-adapter rag reranker reranking retrieval-augmented-generation sqlite tsvector vector-search
Last synced: 14 May 2025
https://github.com/myscale/MyScaleDB
A @ClickHouse fork that supports high-performance vector search and full-text search.
ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb
Last synced: 12 Mar 2025
https://github.com/epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search
Last synced: 15 May 2025
https://github.com/rapidsai/raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
anns building-blocks clustering cuda distance gpu information-retrieval linear-algebra llm machine-learning nearest-neighbors neighborhood-methods primitives random-sampling solvers sparse statistics vector-search vector-similarity vector-store
Last synced: 14 May 2025
https://github.com/tensorchord/vectorchord
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
artificial-intelligence llmops postgresql vector-database vector-search
Last synced: 21 Jun 2025
https://github.com/azure/azure-search-vector-samples
A repository of code samples for Vector search capabilities in Azure AI Search.
azure azurecognitivesearch embeddings vector vector-search
Last synced: 14 May 2025
https://github.com/Azure/azure-search-vector-samples
A repository of code samples for Vector search capabilities in Azure AI Search.
azure azurecognitivesearch embeddings vector vector-search
Last synced: 25 Mar 2025
https://github.com/superlinked/superlinked
A compute framework for building Search, RAG, Recommendations and Analytics over complex structured & unstructured data.
data-pipeline deep-learning embeddings etl information-retrieval llm ml mlops natural-language-processing nlp python retrieval retrieval-augmented-generation semantic-search vector-database vector-search vectorization
Last synced: 13 Mar 2025
https://github.com/prithivirajdamodaran/flashrank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search
Last synced: 14 May 2025
https://github.com/weaviate/recipes
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!
function-calling generative-ai llm-frameworks python retrieval-augmented-generation vector-database vector-search
Last synced: 15 May 2025
https://github.com/nuclia/nucliadb
NucliaDB, The AI Search database for RAG
ai-powered-search database language-model machine-learning mlops nuclia python rust search search-engine search-engines semantic semantic-search-engine text-classification unstructured-data vector-search vector-search-engine vectors
Last synced: 14 May 2025
https://github.com/PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search
Last synced: 08 May 2025
https://github.com/jina-ai/vectordb
A Python vector database you just need - no more, no less.
embedding-similarity neural-search sentence-embeddings vector-database vector-database-embedding vector-search
Last synced: 16 May 2025
https://github.com/unum-cloud/ustore
Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang ποΈ
acid apache-arrow arrow big-data bigdata database dataloader document-database graph-database iouring json key-value-store knn-search networkx nosql pandas python search spdk vector-search
Last synced: 11 Apr 2025
https://github.com/arcadedata/arcadedb
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search
Last synced: 14 May 2025
https://github.com/redis-developer/arxivchatguru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search
Last synced: 15 May 2025
https://github.com/redis-developer/ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search
Last synced: 11 Apr 2025
https://github.com/redis-developer/ArxivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search
Last synced: 18 Jul 2025
https://github.com/anush008/fastembed-rs
Rust library for generating vector embeddings, reranking. Based on qdrant/fastembed.
embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search
Last synced: 02 Jan 2026
https://github.com/ArcadeData/arcadedb
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search
Last synced: 23 Apr 2025
https://github.com/philippgille/chromem-go
Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.
chroma chromadb cosine-similarity embedded embeddings go golang in-memory llm llms nearest-neighbor rag retrieval-augmented-generation vector-database vector-search
Last synced: 05 Apr 2025
https://github.com/Anush008/fastembed-rs
Rust library for generating vector embeddings, reranking locally
embeddings fastembed rag reranker reranking retrieval retrieval-augmented-generation vector-search
Last synced: 01 May 2025
https://github.com/kelindar/search
Go library for embedded vector search and semantic embeddings using llama.cpp
ai bert embeddings gguf gpu llamacpp search-engine semantic-search simd vector-search
Last synced: 16 May 2025
https://github.com/microsoft/rag-time
RAG Time: A 5-week Learning Journey to Mastering RAG
ai azure binary-quantization generative-ai gpt hnsw hybrid-search indexing keyword-search language-model llm matryoshka-representation-learning multimodal openai rag responsible-ai retrieval-augmented-generation scalar-quantization vector-search visual-studio-code
Last synced: 15 May 2025
https://github.com/rapidsai/cuvs
cuVS - a library for vector search and clustering on the GPU
anns clustering cuda distance gpu information-retrieval llm machine-learning nearest-neighbors neighborhood-methods similarity-search sparse statistics vector-search vector-similarity vector-store
Last synced: 14 May 2025
https://github.com/m1guelpf/tinyvector
A tiny embedding database in pure Rust.
embeddings embeddings-similarity machine-learning rust search-engines similarity-search vector-database vector-search
Last synced: 23 Oct 2025
https://github.com/bbc-esq/vectordb-plugin
Plugin that lets you ask questions about your documents including audio and video files.
bark database-management embedding-models embedding-vectors embeddings gtts koboldai koboldcpp python rag retrieval-augmented-generation retrieval-chatbot tiledb vector-data-management vector-database vector-search vision whisper whispers2t whisperspeech
Last synced: 16 May 2025
https://github.com/ukplab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
bert domain-adaptation information-retrieval nlp transformers vector-search
Last synced: 18 Jun 2025
https://github.com/UKPLab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
bert domain-adaptation information-retrieval nlp transformers vector-search
Last synced: 14 Jul 2025
https://github.com/qdrant/vector-db-benchmark
Framework for benchmarking vector search engines
benchmark vector-database vector-search vector-search-engine
Last synced: 15 May 2025
https://github.com/weaviate/weaviate-examples
Weaviate vector database β examples
deep-learning examples vector-database vector-search vector-search-engine weaviate
Last synced: 04 Mar 2025
https://github.com/vector-ai/vectorai
Vector AI β A platform for building vector based applications. Encode, query and analyse data using vectors.
artificial-intelligence clustering compare-vectors deep-learning embeddings encodings machine-learning neural-networks python pytorch search search-engine semantic-search tensorflow transformers vector vector-analytics vector-search vector-similarity vector-similarity-database
Last synced: 04 Apr 2025
https://github.com/edwinkys/oasysdb
An embedded vector database designed to run on edge devices. Lightweight and fast with HNSW indexing algorithm.
approximate-nearest-neighbors edge-ai edge-computing hnsw key-value-database open-source rest-api similarity-search vector-database vector-search
Last synced: 10 Apr 2025
https://github.com/superlinked/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
ai llm llmops ml mlops vector vector-database vector-search vectorops
Last synced: 10 Apr 2025
https://github.com/redis/redis-vl-python
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
embedding large-language-models llm llmcache openai python redis retrieval-augmented-generation semantic-cache vector-database vector-search
Last synced: 14 May 2025
https://github.com/geeks-of-data/knowledge-gpt
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
context embedding embedding-vectors gpt gpt3-turbo gpt4 huggingface huggingface-transformers information-extraction language-model llama llm natural-language-processing openai python question-answering scraper sentence-embeddings sentence-similarity vector-search
Last synced: 04 Apr 2025
https://github.com/BBC-Esq/VectorDB-Plugin
Plugin that lets you ask questions about your documents including audio and video files.
bark database-management embedding-models embedding-vectors embeddings gtts koboldai koboldcpp python rag retrieval-augmented-generation retrieval-chatbot tiledb vector-data-management vector-database vector-search vision whisper whispers2t whisperspeech
Last synced: 25 Oct 2025
https://github.com/esteininger/vector-search
The definitive guide to using Vector Search to solve your semantic search production workload needs.
lucene nlp search-engine vector-search
Last synced: 21 Jul 2025
https://github.com/RelevanceAI/relevanceai
Home of the AI workforce - Multi-agent system, AI agents & tools
clustering computer-vision embeddings natural-language-processing nlp python search search-engine unstructured-data vector-database vector-search
Last synced: 26 Aug 2025
https://github.com/jina-ai/annlite
β‘ A fast embedded library for approximate nearest neighbor search
approximate-nearest-neighbor-search cython hnsw image-search information-retrieval neural-search product-quantization vector-quantization vector-search
Last synced: 31 Jul 2025
https://github.com/nitaiaharoni1/vector-storage
Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.
cosine-similarity embedding-vectors javascript local-storage localstorage lru-cache npm open-source openai semantic-search semantic-similarity typescript vector-database vector-db vector-search vector-similarity vector-similarity-database vector-similarity-search
Last synced: 16 May 2025
https://github.com/relevanceai/relevanceai
Home of the AI workforce - Multi-agent system, AI agents & tools
clustering computer-vision embeddings natural-language-processing nlp python search search-engine unstructured-data vector-database vector-search
Last synced: 15 May 2025
https://github.com/IngestAI/embedditor
β‘ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
datapreprocessing datascience embedding-vectors embeddings genai laravel llm markup-language ml nlp nltk php vector-database vector-search vectorization veml
Last synced: 28 Mar 2025
https://github.com/habedi/hann
A fast approximate nearest neighbor search library for Go
approximate-nearest-neighbor-search go golang indexing-algorithms nearest-neighbor-search search-algorithms similarity-search vector-search
Last synced: 20 Sep 2025
https://github.com/weaviate/weaviate-python-client
A python native client for easy interaction with a Weaviate instance.
Last synced: 14 May 2025
https://github.com/lyellr88/marm-systems
Turn AI into a memory-powered collaborator. Universal MCP Server enabling cross-platform AI memory, multi-agent coordination, and persistent context sharing. Built with MARM protocol for structured reasoning that evolves with your work.
claude-code context-management-system conversational-ai-chatbot developer-tools docker-image embeddings fastapi gemini-cli knowledge-based-systems mcp-server memory-management openai-api-chatbot semantic-search vector-search
Last synced: 10 Oct 2025