An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with llmops

A curated list of projects in awesome lists tagged with llmops .

https://github.com/langgenius/dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

agent ai anthropic backend-as-a-service chatbot gemini genai gpt gpt-4 llama3 llm llmops nextjs openai orchestration python rag workflow workflows

Last synced: 17 Mar 2026

https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda deepseek gpt hpu inference inferentia llama llm llm-serving llmops mlops model-serving pytorch qwen rocm tpu trainium transformer xpu

Last synced: 29 Jan 2026

https://github.com/berriai/litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm llm-gateway llmops mcp-gateway openai openai-proxy vertex-ai

Last synced: 24 May 2026

https://github.com/composiohq/composio

Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.

agentic-ai agents ai ai-agents aiagents developer-tools function-calling gpt-4 javascript js llm llmops mcp python remote-mcp-server sse typescript

Last synced: 12 May 2026

https://github.com/ComposioHQ/composio

Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling

agents ai ai-agents aiagents developer-tools function-calling gpt-4 gpt-4o hacktoberfest hacktoberfest2024 javascript js llm llmops python typescript

Last synced: 24 Mar 2025

https://github.com/pathwaycom/llm-app

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

chatbot hugging-face llm llm-local llm-prompting llm-security llmops machine-learning open-ai pathway rag real-time retrieval-augmented-generation vector-database vector-index

Last synced: 12 May 2025

https://github.com/mlflow/mlflow

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

agentops agents ai ai-governance apache-spark evaluation langchain llm-evaluation llmops machine-learning ml mlflow mlops model-management observability open-source openai prompt-engineering

Last synced: 06 May 2026

https://github.com/BerriAI/litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

ai-gateway anthropic azure-openai bedrock gateway langchain llm llm-gateway llmops openai openai-proxy vertex-ai

Last synced: 23 Mar 2025

https://github.com/comet-ml/opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability llmops open-source openai playground prompt-engineering

Last synced: 14 May 2026

https://github.com/liguodongiot/llm-action

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

llm llm-inference llm-serving llm-training llmops

Last synced: 15 May 2025

https://github.com/transformeroptimus/superagi

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

agents agi ai artificial-general-intelligence artificial-intelligence autonomous-agents gpt-4 hacktoberfest llm llmops nextjs openai pinecone python superagi

Last synced: 12 May 2025

https://github.com/TransformerOptimus/SuperAGI

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

agents agi ai artificial-general-intelligence artificial-intelligence autonomous-agents gpt-4 hacktoberfest llm llmops nextjs openai pinecone python superagi

Last synced: 27 Mar 2025

https://github.com/bentoml/openllm

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Last synced: 23 Oct 2025

https://github.com/langfuse/langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

analytics autogen evaluation langchain large-language-models llama-index llm llm-evaluation llm-observability llmops monitoring observability open-source openai playground prompt-engineering prompt-management self-hosted ycombinator

Last synced: 13 Mar 2026

https://github.com/tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt large-language-models llama llm llmops llms machine-learning ml ml-engineering mlops openai python rust

Last synced: 16 Jan 2026

https://github.com/explodinggradients/ragas

Supercharge Your LLM Application Evaluations 🚀

evaluation llm llmops

Last synced: 22 Aug 2025

https://github.com/bentoml/OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Last synced: 14 Mar 2025

https://github.com/explodinggradients/ragas?tab=readme-ov-file

Supercharge Your LLM Application Evaluations 🚀

evaluation llm llmops

Last synced: 04 Apr 2025

https://github.com/dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

agent ai chatbot enterprise finetune genai gpt langchian llama llm llmdevops llmops ocr openai orchestration python rag react sft workflow

Last synced: 28 Jan 2026

https://github.com/portkey-ai/gateway

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

ai-gateway gateway generative-ai hacktoberfest langchain llama-index llmops llms openai prompt-engineering router

Last synced: 13 May 2025

https://github.com/bentoml/bentoml

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

ai-inference deep-learning generative-ai inference-platform llm llm-inference llm-serving llmops machine-learning ml-engineering mlops model-inference-service model-serving multimodal python

Last synced: 06 Mar 2026

https://github.com/bentoml/BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and much more!

ai-inference deep-learning generative-ai inference-platform llm llm-inference llm-serving llmops machine-learning ml-engineering mlops model-inference-service model-serving multimodal python

Last synced: 12 Mar 2025

https://github.com/evidentlyai/evidently

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

data-drift data-quality data-science data-validation generative-ai hacktoberfest html-report jupyter-notebook llm llmops machine-learning mlops model-monitoring pandas-dataframe

Last synced: 13 May 2025

https://github.com/0xplaygrounds/rig

⚙️🦀 Build modular and scalable LLM Applications in Rust

agent ai artificial-intelligence automation generative-ai large-language-model llm llmops rust scalable-ai

Last synced: 14 Apr 2026

https://github.com/promptfoo/promptfoo

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

ci ci-cd cicd evaluation evaluation-framework llm llm-eval llm-evaluation llm-evaluation-framework llmops pentesting prompt-engineering prompt-testing prompts rag red-teaming testing vulnerability-scanners

Last synced: 03 Mar 2026

https://github.com/googlecloudplatform/agent-starter-pack

Ship AI Agents to Google Cloud in minutes, not months. Production-ready templates with built-in CI/CD, evaluation, and observability.

agents gcp gemini genai-agents generative-ai llmops mlops observability

Last synced: 21 Jan 2026

https://github.com/Portkey-AI/gateway

A Blazing Fast AI Gateway. Route to 200+ LLMs with 1 fast & friendly API.

ai-gateway gateway generative-ai langchain llama-index llmops llms openai prompt-engineering router

Last synced: 24 Mar 2025

https://github.com/maximhq/bifrost

Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.

ai-gateway gateway gateway-services generative-ai guardrails llm llm-cost llm-gateway llm-observability llmops load-balancing mcp-client mcp-gateway mcp-server model-router token-management

Last synced: 07 Jun 2026

https://github.com/agenta-ai/agenta

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

agents evaluation llm-as-a-judge llm-evaluation llm-framework llm-monitoring llm-observability llm-platform llm-playground llm-tools llmops observability prompt-engineering prompt-management rag-evaluation

Last synced: 11 Mar 2026

https://github.com/decodingml/llm-twin-course

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

aws bytewax comet-ml course docker generative-ai infrastructure-as-code large-language-models llmops machine-learning-engineering ml-system-design mlops pulumi qdrant qwak rag superlinked

Last synced: 13 May 2025

https://github.com/tencentmusic/cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

ai aihub argo automl gpt inference kubeflow kubernetes llmops mlops notebook pipeline pytorch spark vgpu workflow

Last synced: 06 Feb 2026

https://github.com/iusztinpaul/hands-on-llms

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

3-pipeline-design aws beam bytewax cicd comet-ml docker fine-tuning generative-ai huggingface langchain llmops llms mlops qdrant qlora streaming transformers

Last synced: 10 Aug 2025

https://josh-xt.github.io/AGiXT/

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 25 Sep 2025

https://github.com/josh-xt/agixt

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 15 Mar 2026

https://github.com/predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

fine-tuning gpt llama llm llm-inference llm-serving llmops lora model-serving pytorch transformers

Last synced: 12 May 2025

https://github.com/PacktPublishing/LLM-Engineers-Handbook

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

aws fine-tuning-llm genai llm llm-evaluation llmops ml-system-design mlops rag

Last synced: 27 Jul 2025

https://github.com/pezzolabs/pezzo

🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.

ai devtools gpt-3 gpt-4 hacktoberfest javascript langchain llm llmops monitoring nestjs nodejs observability openai platform prompt prompt-engineering prompt-management python typescript

Last synced: 13 May 2025

https://github.com/langwatch/langwatch

The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨

ai analytics datasets dspy evaluation gpt llm llm-ops llmops low-code observability openai prompt-engineering

Last synced: 28 Apr 2026

https://github.com/ianarawjo/ChainForge?tab=readme-ov-file

An open-source visual programming environment for battle-testing prompts to LLMs.

ai evaluation large-language-models llmops llms prompt-engineering

Last synced: 08 May 2025

https://github.com/ianarawjo/chainforge

An open-source visual programming environment for battle-testing prompts to LLMs.

ai evaluation large-language-models llmops llms prompt-engineering

Last synced: 14 May 2025

https://github.com/openpipe/openpipe

Turn expensive prompts into cheap fine-tuned models

ai llm llmops prompt-engineering

Last synced: 14 May 2025

https://github.com/OpenPipe/OpenPipe

Turn expensive prompts into cheap fine-tuned models

ai llm llmops prompt-engineering

Last synced: 04 Apr 2025

https://github.com/Josh-XT/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

agent-llm agi agixt ai artificial automation chromadb intelligence llama llamacpp llm llmops openai python

Last synced: 24 Mar 2025

https://github.com/openlit/openlit

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.

ai-observability amd-gpu clickhouse distributed-tracing genai gpu-monitoring grafana langchain llmops llms metrics monitoring-tool nvidia-smi observability open-source openai opentelemetry otlp python tracing

Last synced: 01 May 2026

https://github.com/bionic-gpt/bionic-gpt

Bionic is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality

architecture full-stack llmops llms rust

Last synced: 30 Jan 2026

https://github.com/uptrain-ai/uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

autoevaluation evaluation experimentation hallucination-detection jailbreak-detection llm-eval llm-prompting llm-test llmops machine-learning monitoring openai-evals prompt-engineering root-cause-analysis

Last synced: 14 May 2025

https://github.com/ianarawjo/ChainForge

An open-source visual programming environment for battle-testing prompts to LLMs.

ai evaluation large-language-models llmops llms prompt-engineering

Last synced: 27 Mar 2025

https://github.com/GoogleCloudPlatform/agent-starter-pack

A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (Deployment & Operations, Evaluation, Customization, Observability) in building and deploying GenAI agents.

agents gcp gemini genai-agents generative-ai llmops mlops observability

Last synced: 28 Jun 2025

https://github.com/DAGWorks-Inc/hamilton

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

dag data-analysis data-engineering data-science dataframe etl etl-framework etl-pipeline feature-engineering hacktoberfest lineage llmops machine-learning mlops orchestration pandas python rag software-engineering

Last synced: 26 Mar 2025

https://github.com/Helicone/helicone

🧊 Open source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc. Supports OpenAI SDK, Vercel AI SDK, Anthropic SDK, LiteLLM, LLamaIndex, LangChain, and more. 🍓 YC W23

agent-monitoring analytics evaluation gpt langchain large-language-models llama-index llm llm-cost llm-evaluation llm-observability llmops monitoring open-source openai playground prompt-engineering prompt-management ycombinator

Last synced: 31 Mar 2025

https://github.com/lmnr-ai/lmnr

Laminar - open-source all-in-one platform for engineering AI products. Crate data flywheel for you AI app. Traces, Evals, Datasets, Labels. YC S24.

agents ai ai-observability aiops analytics developer-tools evals evaluation llm-evaluation llm-observability llm-workflow llmops monitoring observability open-source pipeline-builder rag rust-lang self-hosted

Last synced: 14 Apr 2026

https://github.com/apache/burr

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

ai burr chatbot-framework dags generative-ai graphs hacktoberfest llmops llms mlops persistent-data-structure state-machine state-management visibility

Last synced: 08 Jul 2025

https://github.com/agentera/agently

[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Event-Driven Flow *TriggerFlow* to manage complex GenAI working logic 🔀 Switch to any model without rewrite application code

agent agent-based-framework agent-framework chatglm claude deepseek ernie framework gemini google-gemini gpt llm-agent llm-application llm-apps llm-framework llmops llms minimax python

Last synced: 25 May 2026

https://github.com/trypromptly/LLMStack

No-code multi-agent framework to build LLM Agents, workflows and applications with your data

agents ai ai-agents-framework generative-ai llm-agents llm-chain llm-framework llmops llms no-code-ai platform

Last synced: 28 Mar 2025

https://github.com/decodingai-magazine/second-brain-ai-assistant-course

Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.

agents ai-systems data-engineering fine-tuning huggingface llm llmops mlops openai python rag

Last synced: 18 Jan 2026

https://github.com/Maplemx/Agently

[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Agently Workflow to manage complex GenAI working logic 🔀 Switch to any model without rewrite application code

agent agent-based-framework agent-framework chatglm claude ernie framework gemini google-gemini gpt llm-agent llm-application llm-apps llm-framework llmops llms minimax python wenxinyiyan

Last synced: 06 May 2025

https://github.com/ThousandBirdsInc/chidori

A reactive runtime for building durable AI agents

agents ai debugging framework llmops llms orchestration

Last synced: 31 Mar 2025

https://github.com/thousandbirdsinc/chidori

A reactive runtime for building durable AI agents

agents ai debugging framework llmops llms orchestration

Last synced: 14 May 2025

https://github.com/Agenta-AI/agenta

The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

human-annotation langchain large-language-models llama-index llm llm-evaluation llm-framework llm-tools llmops llms prompt-engineering prompt-management prompt-toolkit rag rag-evaluation

Last synced: 13 Mar 2025

https://github.com/DAGWorks-Inc/burr

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

ai burr chatbot-framework dags generative-ai graphs hacktoberfest llmops llms mlops persistent-data-structure state-machine state-management visibility

Last synced: 16 Apr 2025

https://github.com/prometheus-eval/prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

evaluation gpt4 litellm llm llm-as-a-judge llm-as-evaluator llmops python vllm

Last synced: 06 Apr 2026

https://github.com/dillionverma/llm.report

📊 llm.report is an open-source logging and analytics platform for OpenAI: Log your ChatGPT API requests, analyze costs, and improve your prompts.

aiops gpt-3 gpt-4 llm llmops mlops nextjs nodejs open-source openai react shadcn-ui typescript

Last synced: 16 May 2025

https://github.com/ajndkr/lanarky

The web framework for building LLM microservices

fastapi llmops microservices python3 web

Last synced: 15 May 2025

https://github.com/msoedov/langcorn

⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops

api fastapi langchain langchain-python large-language-models llm llmops openai-api rest-api vercel vercel-serverless-functions

Last synced: 15 May 2025

https://github.com/scale3-labs/langtrace

Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊

ai datasets evaluations gpt langchain llm llm-framework llmops observability open-source open-telemetry openai prompt-engineering tracing

Last synced: 15 May 2025

https://github.com/alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

gpt inference llama llm llm-serving llmops model-serving

Last synced: 14 Oct 2025

https://github.com/tensorchord/vectorchord

Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.

artificial-intelligence llmops postgresql vector-database vector-search

Last synced: 21 Jun 2025

https://github.com/getmetal/motorhead

🧠 Motorhead is a memory and information retrieval server for LLMs.

llmops llms machine-learning ml mlops rust

Last synced: 28 Mar 2025

https://github.com/neumtry/neumai

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 29 Oct 2025

https://github.com/NeumTry/NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 11 Apr 2025

https://github.com/onlyphantom/llm-python

Large Language Models (LLMs) tutorials & sample scripts, ft. langchain, openai, llamaindex, gpt, chromadb & pinecone

chromadb gpt-3 langchain langchain-python llamaindex llm llmops openai-api pinecone tutorial

Last synced: 15 May 2025

https://github.com/intentee/paddler

Stateful load balancer custom-tailored for llama.cpp 🏓🦙

ai llamacpp llm llmops load-balancer

Last synced: 06 Jan 2026

https://github.com/vllm-project/vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

ascend inference llm llm-serving llmops mlops model-serving transformer vllm

Last synced: 27 Feb 2026

https://github.com/katanemo/archgw

Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with your APIs - outside business logic. Built by the core contributors of Envoy proxy, on Envoy.

ai-gateway envoy envoyproxy gateway generative-ai llm-gateway llm-inference llm-routing llmops llms openai prompt proxy proxy-server routing

Last synced: 21 Oct 2025