An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with vector-database

A curated list of projects in awesome lists tagged with vector-database .

https://github.com/mintplex-labs/anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

agent-framework-javascript ai-agents custom-ai-agents deepseek deepseek-r1 llama3 llm llm-webui lmstudio local-llm localai mcp mcp-servers multimodal no-code ollama qwen3 rag vector-database

Last synced: 31 Oct 2025

https://github.com/run-llama/llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database

Last synced: 09 Sep 2025

https://github.com/jerryjliu/llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database

Last synced: 04 Apr 2025

https://github.com/mem0ai/mem0

Memory for AI Agents; SOTA in AI Agent Memory, beating OpenAI Memory in accuracy by 26% - https://mem0.ai/research

agent ai aiagent application chatbots chatgpt embeddings llm long-term-memory memory memory-management python rag state-management vector-database

Last synced: 12 May 2025

https://github.com/pathwaycom/llm-app

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

chatbot hugging-face llm llm-local llm-prompting llm-security llmops machine-learning open-ai pathway rag real-time retrieval-augmented-generation vector-database vector-index

Last synced: 12 May 2025

https://github.com/qdrant/qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

ai-search ai-search-engine embeddings-similarity hnsw image-search knn-algorithm machine-learning mlops nearest-neighbor-search neural-network neural-search recommender-system search search-engine search-engines similarity-search vector-database vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate

Last synced: 12 May 2025

https://github.com/oramasearch/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 12 May 2025

https://github.com/askorama/orama

🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.

algiorithm data-structures full-text javascript node search search-algorithm search-engine typescript typo-tolerance vector vector-database vector-database-embedding vector-search vector-search-engine

Last synced: 09 Apr 2025

https://github.com/langchain4j/langchain4j

LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.

anthropic chatgpt chroma embeddings gemini gpt huggingface java langchain llama llm llms milvus ollama onnx openai openai-api pgvector pinecone vector-database

Last synced: 08 Oct 2025

https://github.com/oceanbase/oceanbase

OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.

analytics bigquery cloud-native cpp database distributed-database distributed-transactions hacktoberfest htap mysql mysql-compatibility mysql-database oceanbase olap oltp paxos scalable sql vector-database

Last synced: 13 May 2025

https://github.com/activeloopai/deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search

Last synced: 13 May 2025

https://github.com/reorproject/reor

Private & local AI personal knowledge management app for high entropy people.

ai lancedb llama llamacpp local-first markdown note-taking ollama pkm rag second-brain vector-database

Last synced: 12 May 2025

https://zilliztech.github.io/deep-searcher/

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent agentic-rag claude deep-research deepseek deepseek-r1 grok grok3 llama4 llm milvus openai qwen3 rag reasoning-models vector-database zilliz

Last synced: 22 Jul 2025

https://github.com/lancedb/lancedb

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

approximate-nearest-neighbor-search image-search nearest-neighbor-search recommender-system search-engine semantic-search similarity-search vector-database

Last synced: 12 May 2025

https://lancedb.github.io/lancedb/

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

approximate-nearest-neighbor-search image-search nearest-neighbor-search recommender-system search-engine semantic-search similarity-search vector-database

Last synced: 04 May 2025

https://github.com/mariadb/server

MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry.

amazon-web-services database fulltext-search galera geographical-information-system innodb json mariadb mysql nearest-neighbor-search rdbms relational-databases sql storage-engine vector-database

Last synced: 11 May 2025

https://github.com/redisearch/redisearch

A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.

fulltext geospatial gis inverted-index redis redis-module search search-engine vector-database

Last synced: 13 May 2025

https://github.com/RediSearch/RediSearch

A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.

fulltext geospatial gis inverted-index redis redis-module search search-engine vector-database

Last synced: 24 Mar 2025

https://github.com/MariaDB/server

MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry.

amazon-web-services database fulltext-search galera geographical-information-system innodb json mariadb mysql rdbms relational-databases sql storage-engine vector-database

Last synced: 28 Mar 2025

https://github.com/crate/crate

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

analytics big-data cratedb database dbms distributed distributed-database distributed-sql-database elasticsearch industrial-iot iot iot-analytics iot-database lucene olap postgresql sql time-series tsdb vector-database

Last synced: 13 May 2025

https://github.com/infiniflow/infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

ai-native approximate-nearest-neighbor-search bm25 cpp20 cpp20-modules embedding full-text-search hnsw hybrid-search information-retrival nearest-neighbor-search rag search-engine tensor-database vector vector-database vector-search vectordatabase

Last synced: 12 May 2025

https://github.com/pinecone-io/examples

Jupyter Notebooks to help you get hands-on with Pinecone vector databases

ai jupyter-notebook llm python semantic-search vector-database

Last synced: 13 May 2025

https://github.com/pingcap/autoflow

pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidb.ai

chatbot cot graphrag knowledge-graph mysql rag serverless vector-database

Last synced: 13 May 2025

https://github.com/tensorchord/pgvecto.rs

Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.

chatgpt faiss gpt hacktoberfest llm nearest-neighbor-search postgres rust vector vector-database

Last synced: 13 May 2025

https://github.com/volcengine/MineContext

MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)

agent context-engineering electron embedding-models memory proactive-ai python python3 rag react vector-database vision-language-model

Last synced: 21 Oct 2025

https://github.com/featureform/featureform

The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

data-quality data-science embeddings embeddings-similarity feature-engineering feature-store hacktoberfest machine-learning ml mlops python vector-database

Last synced: 14 Dec 2025

https://github.com/firebase/genkit

An open source framework for building AI-powered apps with familiar code-centric patterns. Genkit makes it easy to develop, integrate, and test AI features with observability and evaluations. Genkit works with various models and platforms.

agents ai embedders genkit llm machine-learning multimodal rag vector-database

Last synced: 12 May 2025

https://github.com/zilliztech/attu

Web UI for Milvus Vector Database

attu milvus vector-database

Last synced: 24 Dec 2025

https://github.com/mintplex-labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 31 Oct 2025

https://github.com/Mintplex-Labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 24 Mar 2025

https://github.com/pixeltable/pixeltable

Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

ai artificial-intelligence chatbot computer-vision data-science database feature-engineering feature-store genai llm machine-learning ml mlops multimodal vector-database

Last synced: 16 Dec 2025

https://github.com/llphant/llphant

LLPhant - A comprehensive PHP Generative AI Framework using OpenAI GPT 4. Inspired by Langchain

agent autophp embeddings genai generative-ai gpt4 langchain laravel llamaindex openai php symfony vector-database

Last synced: 08 Oct 2025

https://github.com/qdrant/qdrant-client

Python client for Qdrant vector search engine

qdrant vector-database vector-search vector-search-engine

Last synced: 08 Jul 2025

https://github.com/pinecone-io/canopy

Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

generative-ai llm rag vector-database

Last synced: 09 Jul 2025

https://github.com/LLPhant/LLPhant

LLPhant - A comprehensive PHP Generative AI Framework using OpenAI GPT 4. Inspired by Langchain

agent autophp embeddings genai generative-ai gpt4 langchain laravel llamaindex openai php symfony vector-database

Last synced: 20 Sep 2025

https://github.com/epsilla-cloud/vectordb

Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/

ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search

Last synced: 15 May 2025

https://github.com/skywalkerdarren/chatweb

ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

ai chatgpt crawler docx embedding faiss gpt gpt-35-turbo news-extractor newspaper openai pdf pgvector postgresql vector-database

Last synced: 25 Oct 2025

https://github.com/SkywalkerDarren/chatWeb

ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

ai chatgpt crawler docx embedding faiss gpt gpt-35-turbo news-extractor newspaper openai pdf pgvector postgresql vector-database

Last synced: 30 Mar 2025

https://github.com/tensorchord/vectorchord

Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.

artificial-intelligence llmops postgresql vector-database vector-search

Last synced: 21 Jun 2025

https://github.com/neumtry/neumai

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 29 Oct 2025

https://github.com/JSv4/OpenContracts

Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt playground, and more!

agent agentic-ai etl etl-pipeline llm prompt-engineering unstructured-data vector-database

Last synced: 08 May 2025

https://github.com/NeumTry/NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 11 Apr 2025

https://github.com/superlinked/superlinked

A compute framework for building Search, RAG, Recommendations and Analytics over complex structured & unstructured data.

data-pipeline deep-learning embeddings etl information-retrieval llm ml mlops natural-language-processing nlp python retrieval retrieval-augmented-generation semantic-search vector-database vector-search vectorization

Last synced: 13 Mar 2025

https://github.com/prithivirajdamodaran/flashrank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 14 May 2025

https://github.com/pchunduri6/rag-demystified

An LLM-powered advanced RAG pipeline built from scratch

ai chatgpt gpt llm question-answering rag retrieval-augmented-generation vector-database

Last synced: 25 Mar 2025

https://github.com/lancedb/vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

agents ai deep-learning embeddings fine-tuning gpt gpt-4-vision langchain llama-index llms machine-learning multimodal openai rag vector-database

Last synced: 17 Oct 2025

https://github.com/weaviate/recipes

This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!

function-calling generative-ai llm-frameworks python retrieval-augmented-generation vector-database vector-search

Last synced: 15 May 2025

https://github.com/inspector-apm/neuron-ai

The PHP Agent Development Kit - powered by Inspector.dev

agent agentic-ai agentic-framework agents ai llm llm-inference llms php vector-database

Last synced: 21 Jun 2025

https://github.com/PrithivirajDamodaran/FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

cross-encoder full-text-search hybrid-search lexical-search rag ranking reranking retrieval-augmented-generation semantic-search vector-database vector-search

Last synced: 08 May 2025

https://github.com/skyzh/write-you-a-vector-db

A Vector Database Tutorial (over CMU-DB's BusTub system)

bustub database tutorial vector-database

Last synced: 05 Apr 2025

https://github.com/pgalko/bambooai

A Python library powered by Language Models (LLMs) for conversational data discovery and analysis.

ai ai-agents anthropic data-analysis data-science docker gemini groq llm mistral ollama openai-api pandas pinecone python vector-database vllm

Last synced: 15 May 2025

https://github.com/arcadedata/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 14 May 2025

https://github.com/hhblaze/dbreeze

C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 14 May 2025

https://github.com/redis-developer/arxivchatguru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 15 May 2025

https://github.com/redis-developer/ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 11 Apr 2025

https://github.com/neonwatty/meme-search

The open source Meme Search Engine and Finder. Free and built to self-host locally with Python, Ruby, and Docker.

docker machine-learning python ruby-on-rails self-hosted vector-database vision-language-model

Last synced: 15 May 2025

https://github.com/redis-developer/ArxivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

ai arxiv langchain machine-learning openai python question-answering rag redis retrieval retrieval-augmented-generation streamlit vector-database vector-search

Last synced: 18 Jul 2025

https://github.com/hhblaze/DBreeze

C# .NET NOSQL ( key value store embedded ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 14 Mar 2025

https://github.com/danny-avila/rag_api

ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector

api api-rest embeddings fastapi langchain pgvector postgresql psql python rag vector vector-database

Last synced: 15 May 2025

https://github.com/ArcadeData/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 23 Apr 2025

https://github.com/philippgille/chromem-go

Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.

chroma chromadb cosine-similarity embedded embeddings go golang in-memory llm llms nearest-neighbor rag retrieval-augmented-generation vector-database vector-search

Last synced: 05 Apr 2025

https://github.com/upstash/wikipedia-semantic-search

Semantic Search on Wikipedia with Upstash Vector

ai search semantic vector vector-database

Last synced: 26 Jun 2025

https://github.com/azure-samples/aisearch-openai-rag-audio

A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.

ai-azd-templates azd-templates azure azure-ai-search generative-ai gpt language-model openai rag retrieval-augmented-generation search vector-database

Last synced: 15 May 2025

https://github.com/pgalko/BambooAI

A lightweight library that leverages Language Models (LLMs) to enable natural language interactions, allowing you to source and converse with data.

ai ai-agents data-analysis data-science gemini groq llm mistral ollama openai-api pandas pinecone python vector-database

Last synced: 23 Mar 2025

https://github.com/guangzhengli/vectorhub

Quickly and easily build AI website or application by using embeddings!

chatgpt chatpdf embedding embeddings gpt gpt-3 nextjs supabase vector vector-database

Last synced: 05 Apr 2025

https://github.com/neurocult/agency

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.

agents ai artificial-general-intelligence artificial-intelligence artificial-neural-networks autonomous-agents chatgpt generative-ai go golang gpt language-models llm llmops machine-learning neural-network nlp openai rag vector-database

Last synced: 09 Apr 2025

https://github.com/superagent-ai/super-rag

Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.

agents ai embeddings inference rag vector-database

Last synced: 04 Apr 2025

https://github.com/HelixDB/helix-db

HelixDB is a powerful, graph-vector database built in Rust for millisecond query latency and ease of use.

ai cli database databases graph-database helix helixdb neo4j qdrant rag rust rust-crate rust-lang surrealdb vector vector-database vector-db vectorsearch

Last synced: 02 May 2025

https://github.com/souvikmajumder26/Multi-Agent-Medical-Assistant

⚕️GenAI powered multi-agentic medical diagnostics and healthcare research assistance chatbot. 🏥 Designed for healthcare professionals, researchers and patients.

agent agentic-ai agents chatbot computer-vision disease-detection genai genai-chatbot generative-ai guardrails langchain langgraph large-language-models llm medical-image-processing medical-imaging python rag retrieval-augmented-generation vector-database

Last synced: 02 May 2025

https://github.com/Azure-Samples/aisearch-openai-rag-audio

A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.

ai-azd-templates azd-templates azure azure-ai-search generative-ai gpt language-model openai rag retrieval-augmented-generation search vector-database

Last synced: 12 Oct 2025