An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with vector-database

A curated list of projects in awesome lists tagged with vector-database .

https://github.com/mintplex-labs/anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

ai-agents custom-ai-agents deepseek kimi llama3 llm lmstudio local-llm localai mcp mcp-servers moonshot multimodal no-code ollama qwen3 rag vector-database web-scraping

Last synced: 18 Feb 2026

https://github.com/run-llama/llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database

Last synced: 18 Feb 2026

https://github.com/jerryjliu/llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database

Last synced: 04 Apr 2025

https://github.com/mem0ai/mem0

Memory for AI Agents; SOTA in AI Agent Memory, beating OpenAI Memory in accuracy by 26% - https://mem0.ai/research

agent ai aiagent application chatbots chatgpt embeddings llm long-term-memory memory memory-management python rag state-management vector-database

Last synced: 15 Jan 2026

https://github.com/pathwaycom/llm-app

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

chatbot hugging-face llm llm-local llm-prompting llm-security llmops machine-learning open-ai pathway rag real-time retrieval-augmented-generation vector-database vector-index

Last synced: 12 May 2025

https://github.com/qdrant/qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

ai-search ai-search-engine embeddings-similarity hnsw image-search knn-algorithm machine-learning mlops nearest-neighbor-search neural-network neural-search recommender-system search search-engine search-engines similarity-search vector-database vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate

Last synced: 24 Feb 2026

https://github.com/memvid/memvid

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

ai context embedded faiss knowledge-base knowledge-graph llm machine-learning memory memvid mv2 nlp offline-first opencv python rag retrieval-augmented-generation semantic-search vector-database video-processing

Last synced: 15 Feb 2026

https://github.com/yichuan-w/leann

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage offline-first ollama privacy python rag retrieval-augmented-generation vector-database vector-search vectors

Last synced: 08 Mar 2026

https://github.com/oramasearch/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/askorama/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 09 Apr 2025

https://github.com/langchain4j/langchain4j

LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.

anthropic chatgpt chroma embeddings gemini gpt huggingface java langchain llama llm llms milvus ollama onnx openai openai-api pgvector pinecone vector-database

Last synced: 08 Oct 2025

https://github.com/oceanbase/oceanbase

OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.

analytics bigquery cloud-native cpp database distributed-database distributed-transactions hacktoberfest htap mysql mysql-compatibility mysql-database oceanbase olap oltp paxos scalable sql vector-database

Last synced: 13 May 2025

https://github.com/activeloopai/deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search

Last synced: 11 Feb 2026

https://github.com/lancedb/lancedb

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

approximate-nearest-neighbor-search image-search nearest-neighbor-search recommender-system search-engine semantic-search similarity-search vector-database

Last synced: 17 Mar 2026

https://github.com/reorproject/reor

Private & local AI personal knowledge management app for high entropy people.

ai lancedb llama llamacpp local-first markdown note-taking ollama pkm rag second-brain vector-database

Last synced: 12 May 2025

https://zilliztech.github.io/deep-searcher/

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent agentic-rag claude deep-research deepseek deepseek-r1 grok grok3 llama4 llm milvus openai qwen3 rag reasoning-models vector-database zilliz

Last synced: 22 Jul 2025

https://lancedb.github.io/lancedb/

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

approximate-nearest-neighbor-search image-search nearest-neighbor-search recommender-system search-engine semantic-search similarity-search vector-database

Last synced: 04 May 2025

https://github.com/mariadb/server

MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry.

amazon-web-services database fulltext-search galera geographical-information-system innodb json mariadb mysql nearest-neighbor-search rdbms relational-databases sql storage-engine vector-database

Last synced: 11 May 2025

https://github.com/redisearch/redisearch

A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.

fulltext geospatial gis inverted-index redis redis-module search search-engine vector-database

Last synced: 13 May 2025

https://github.com/RediSearch/RediSearch

A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.

fulltext geospatial gis inverted-index redis redis-module search search-engine vector-database

Last synced: 24 Mar 2025

https://github.com/MariaDB/server

MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry.

amazon-web-services database fulltext-search galera geographical-information-system innodb json mariadb mysql rdbms relational-databases sql storage-engine vector-database

Last synced: 28 Mar 2025

https://github.com/crate/crate

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

analytics big-data cratedb database dbms distributed distributed-database distributed-sql-database elasticsearch industrial-iot iot iot-analytics iot-database lucene olap postgresql sql time-series tsdb vector-database

Last synced: 16 Jan 2026

https://github.com/infiniflow/infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

ai-native approximate-nearest-neighbor-search bm25 cpp20 cpp20-modules embedding full-text-search hnsw hybrid-search information-retrival nearest-neighbor-search rag search-engine tensor-database vector vector-database vector-search vectordatabase

Last synced: 12 May 2025

https://github.com/pinecone-io/examples

Jupyter Notebooks to help you get hands-on with Pinecone vector databases

ai jupyter-notebook llm python semantic-search vector-database

Last synced: 13 May 2025

https://github.com/pingcap/autoflow

pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidb.ai

chatbot cot graphrag knowledge-graph mysql rag serverless vector-database

Last synced: 13 May 2025

https://github.com/tensorchord/pgvecto.rs

Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.

chatgpt faiss gpt hacktoberfest llm nearest-neighbor-search postgres rust vector vector-database

Last synced: 13 May 2025

https://github.com/volcengine/MineContext

MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)

agent context-engineering electron embedding-models memory proactive-ai python python3 rag react vector-database vision-language-model

Last synced: 21 Oct 2025

https://github.com/featureform/featureform

The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

data-quality data-science embeddings embeddings-similarity feature-engineering feature-store hacktoberfest machine-learning ml mlops python vector-database

Last synced: 14 Dec 2025

https://github.com/firebase/genkit

An open source framework for building AI-powered apps with familiar code-centric patterns. Genkit makes it easy to develop, integrate, and test AI features with observability and evaluations. Genkit works with various models and platforms.

agents ai embedders genkit llm machine-learning multimodal rag vector-database

Last synced: 13 Mar 2026

https://github.com/zilliztech/attu

Web UI for Milvus Vector Database

attu milvus vector-database

Last synced: 24 Dec 2025

https://github.com/mintplex-labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 31 Oct 2025

https://github.com/Mintplex-Labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 24 Mar 2025

https://github.com/neuron-core/neuron-ai

The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data. With its modular architecture it's best suited for building RAG, multi-agent workflows, or business process automations.

agent agentic-ai agentic-framework agents ai llm llm-inference llms php vector-database

Last synced: 06 Mar 2026

https://github.com/pixeltable/pixeltable

Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

ai artificial-intelligence chatbot computer-vision data-science database feature-engineering feature-store genai llm machine-learning ml mlops multimodal vector-database

Last synced: 01 Mar 2026

https://github.com/superlinked/superlinked

Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

data-pipeline deep-learning embeddings etl information-retrieval llm ml mlops natural-language-processing nlp python retrieval retrieval-augmented-generation semantic-search vector-database vector-search vectorization

Last synced: 16 Jan 2026

https://github.com/doobidoo/mcp-memory-service

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

agent-memory agentic-ai ai-agents autogen claude crewai knowledge-graph langgraph long-term-memory mcp mcp-server memory model-context-protocol multi-agent open-source rag semantic-search sqlite-vec vector-database vector-storage

Last synced: 10 Mar 2026

https://github.com/llphant/llphant

LLPhant - A comprehensive PHP Generative AI Framework using OpenAI GPT 4. Inspired by Langchain

agent autophp embeddings genai generative-ai gpt4 langchain laravel llamaindex openai php symfony vector-database

Last synced: 07 Mar 2026

https://github.com/qdrant/qdrant-client

Python client for Qdrant vector search engine

qdrant vector-database vector-search vector-search-engine

Last synced: 08 Jul 2025

https://github.com/pinecone-io/canopy

Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

generative-ai llm rag vector-database

Last synced: 09 Jul 2025

https://github.com/LLPhant/LLPhant

LLPhant - A comprehensive PHP Generative AI Framework using OpenAI GPT 4. Inspired by Langchain

agent autophp embeddings genai generative-ai gpt4 langchain laravel llamaindex openai php symfony vector-database

Last synced: 20 Sep 2025

https://github.com/epsilla-cloud/vectordb

Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/

ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search

Last synced: 15 May 2025

https://github.com/skywalkerdarren/chatweb

ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

ai chatgpt crawler docx embedding faiss gpt gpt-35-turbo news-extractor newspaper openai pdf pgvector postgresql vector-database

Last synced: 25 Oct 2025

https://github.com/SkywalkerDarren/chatWeb

ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

ai chatgpt crawler docx embedding faiss gpt gpt-35-turbo news-extractor newspaper openai pdf pgvector postgresql vector-database

Last synced: 30 Mar 2025

https://github.com/tensorchord/vectorchord

Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.

artificial-intelligence llmops postgresql vector-database vector-search

Last synced: 21 Jun 2025

https://github.com/neumtry/neumai

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 29 Oct 2025

https://github.com/JSv4/OpenContracts

Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt playground, and more!

agent agentic-ai etl etl-pipeline llm prompt-engineering unstructured-data vector-database

Last synced: 08 May 2025

https://github.com/NeumTry/NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 11 Apr 2025

https://github.com/prithivirajdamodaran/flashrank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 14 May 2025

https://github.com/pchunduri6/rag-demystified

An LLM-powered advanced RAG pipeline built from scratch

ai chatgpt gpt llm question-answering rag retrieval-augmented-generation vector-database

Last synced: 25 Mar 2025

https://github.com/lancedb/vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

agents ai deep-learning embeddings fine-tuning gpt gpt-4-vision langchain llama-index llms machine-learning multimodal openai rag vector-database

Last synced: 17 Oct 2025

https://github.com/weaviate/recipes

This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!

function-calling generative-ai llm-frameworks python retrieval-augmented-generation vector-database vector-search

Last synced: 15 May 2025

https://github.com/inspector-apm/neuron-ai

The PHP Agent Development Kit - powered by Inspector.dev

agent agentic-ai agentic-framework agents ai llm llm-inference llms php vector-database

Last synced: 21 Jun 2025

https://github.com/kossakovsky/n8n-install

🚀 Self-hosted AI automation platform. Deploy n8n, Ollama, Flowise, RAG, Supabase & 30+ tools with one command. Auto HTTPS. Free Zapier/Make alternative.

ai ai-agents automation chatgpt-alternative dify docker flowise homelab llm local-llm make-alternative n8n no-code ollama open-webui qdrant rag self-hosted vector-database zapier-alternative

Last synced: 16 Mar 2026

https://github.com/PrithivirajDamodaran/FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 08 May 2025

https://github.com/skyzh/write-you-a-vector-db

A Vector Database Tutorial (over CMU-DB's BusTub system)

bustub database tutorial vector-database

Last synced: 05 Apr 2025

https://github.com/christopherkarani/Wax

🍯 Memory layer for on-device AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer.

ai-agents cli coreml coreml-framework data-science machine-learning mcp mcp-server memory memory-cache memory-hacking metal on-device-ai rag rag-pipeline swift vector-database vector-embeddings vector-search vectordb

Last synced: 04 Mar 2026

https://github.com/pgalko/bambooai

A Python library powered by Language Models (LLMs) for conversational data discovery and analysis.

ai ai-agents anthropic data-analysis data-science docker gemini groq llm mistral ollama openai-api pandas pinecone python vector-database vllm

Last synced: 15 May 2025

https://github.com/arcadedata/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 14 May 2025

https://github.com/hhblaze/dbreeze

C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 20 Feb 2026

https://github.com/redis-developer/arxivchatguru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 15 May 2025

https://github.com/redis-developer/ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 11 Apr 2025

https://github.com/neonwatty/meme-search

The open source Meme Search Engine and Finder. Free and built to self-host locally with Python, Ruby, and Docker.

docker machine-learning python ruby-on-rails self-hosted vector-database vision-language-model

Last synced: 15 May 2025

https://github.com/redis-developer/ArxivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 18 Jul 2025

https://github.com/hhblaze/DBreeze

C# .NET NOSQL ( key value store embedded ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 14 Mar 2025

https://github.com/danny-avila/rag_api

ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector

api api-rest embeddings fastapi langchain pgvector postgresql psql python rag vector vector-database

Last synced: 15 May 2025

https://github.com/ArcadeData/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 23 Apr 2025

https://github.com/philippgille/chromem-go

Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.

chroma chromadb cosine-similarity embedded embeddings go golang in-memory llm llms nearest-neighbor rag retrieval-augmented-generation vector-database vector-search

Last synced: 05 Apr 2025

https://github.com/upstash/wikipedia-semantic-search

Semantic Search on Wikipedia with Upstash Vector

ai search semantic vector vector-database

Last synced: 26 Jun 2025

https://github.com/azure-samples/aisearch-openai-rag-audio

A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.

ai-azd-templates azd-templates azure azure-ai-search generative-ai gpt language-model openai rag retrieval-augmented-generation search vector-database

Last synced: 15 May 2025

https://github.com/pgalko/BambooAI

A lightweight library that leverages Language Models (LLMs) to enable natural language interactions, allowing you to source and converse with data.

ai ai-agents data-analysis data-science gemini groq llm mistral ollama openai-api pandas pinecone python vector-database

Last synced: 23 Mar 2025

https://github.com/verygoodplugins/automem

AutoMem is a graph-vector memory service that gives AI assistants durable, relational memory:

ai ai-memory anthropic falkordb graph-database llm memory qdrant vector-database

Last synced: 21 Jan 2026