Awesome-LLMOps

🎉 An awesome & curated list of best LLMOps tools.
https://github.com/InftyAI/Awesome-LLMOps

Last synced: 6 days ago
JSON representation

Inference
- Inference Engine
 - DeepSpeed-MII - latency and high-throughput inference possible, powered by DeepSpeed. ![Stars](https://img.shields.io/github/stars/microsoft/deepspeed-mii.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/microsoft/deepspeed-mii?color=green) ![LastCommit](https://img.shields.io/github/last-commit/microsoft/deepspeed-mii?color=green)
 - ipex-llm - VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. ![Stars](https://img.shields.io/github/stars/intel-analytics/ipex-llm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/intel-analytics/ipex-llm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/intel-analytics/ipex-llm?color=green)
 - LMDeploy - commit/internlm/lmdeploy?color=green)
 - llama.cpp - commit/ggerganov/llama.cpp?color=green)
 - MInference - context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy. ![Stars](https://img.shields.io/github/stars/microsoft/minference.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/microsoft/minference?color=green) ![LastCommit](https://img.shields.io/github/last-commit/microsoft/minference?color=green) ![Tag](https://img.shields.io/badge/long_context-orange)
 - MLC LLM - ai/mlc-llm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/mlc-ai/mlc-llm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/mlc-ai/mlc-llm?color=green)
 - Ollama - R1, Phi-4, Gemma 3, and other large language models. ![Stars](https://img.shields.io/github/stars/ollama/ollama.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/ollama/ollama?color=green) ![LastCommit](https://img.shields.io/github/last-commit/ollama/ollama?color=green)
 - OpenLLM - source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. ![Stars](https://img.shields.io/github/stars/bentoml/openllm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/bentoml/openllm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/bentoml/openllm?color=green)
 - Ratchet - platform browser ML framework. ![Stars](https://img.shields.io/github/stars/huggingface/ratchet.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/huggingface/ratchet?color=green) ![LastCommit](https://img.shields.io/github/last-commit/huggingface/ratchet?color=green) ![Tag](https://img.shields.io/badge/browser-orange)
 - MLServer - model serving and more. ![Stars](https://img.shields.io/github/stars/seldonio/mlserver.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/seldonio/mlserver?color=green) ![LastCommit](https://img.shields.io/github/last-commit/seldonio/mlserver?color=green)
 - SGLang - project/sglang.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/sgl-project/sglang?color=green) ![LastCommit](https://img.shields.io/github/last-commit/sgl-project/sglang?color=green)
 - Triton Inference Server - inference-server/server.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/triton-inference-server/server?color=green) ![LastCommit](https://img.shields.io/github/last-commit/triton-inference-server/server?color=green)
 - vLLM - throughput and memory-efficient inference and serving engine for LLMs. ![Stars](https://img.shields.io/github/stars/vllm-project/vllm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/vllm-project/vllm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/vllm-project/vllm?color=green)
 - zml - commit/zml/zml?color=green)
 - Text Generation Inference - generation-inference.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/huggingface/text-generation-inference?color=green) ![LastCommit](https://img.shields.io/github/last-commit/huggingface/text-generation-inference?color=green)
 - web-llm - performance In-browser LLM Inference Engine. ![Stars](https://img.shields.io/github/stars/mlc-ai/web-llm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/mlc-ai/web-llm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/mlc-ai/web-llm?color=green) ![Tag](https://img.shields.io/badge/browser-orange)
 - TinyGrad - commit/tinygrad/tinygrad?color=green)
 - Xinference - source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. ![Stars](https://img.shields.io/github/stars/xorbitsai/inference.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/xorbitsai/inference?color=green) ![LastCommit](https://img.shields.io/github/last-commit/xorbitsai/inference?color=green)
 - Nvidia Dynamo - dynamo/dynamo.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/ai-dynamo/dynamo?color=green) ![LastCommit](https://img.shields.io/github/last-commit/ai-dynamo/dynamo?color=green)
 - Llumnix - instance LLM serving. ![Stars](https://img.shields.io/github/stars/alibabapai/llumnix.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/alibabapai/llumnix?color=green) ![LastCommit](https://img.shields.io/github/last-commit/alibabapai/llumnix?color=green)
 - OpenVINO - commit/openvinotoolkit/openvino?color=green)
 - SGLang - project/sglang.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/sgl-project/sglang?color=green) ![LastCommit](https://img.shields.io/github/last-commit/sgl-project/sglang?color=green)
 - transformers.js - of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server! ![Stars](https://img.shields.io/github/stars/huggingface/transformers.js.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/huggingface/transformers.js?color=green) ![LastCommit](https://img.shields.io/github/last-commit/huggingface/transformers.js?color=green) ![Tag](https://img.shields.io/badge/browser-orange)
 - web-llm - performance In-browser LLM Inference Engine. ![Stars](https://img.shields.io/github/stars/mlc-ai/web-llm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/mlc-ai/web-llm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/mlc-ai/web-llm?color=green) ![Tag](https://img.shields.io/badge/browser-orange)
 - LoRAX - LoRA inference server that scales to 1000s of fine-tuned LLMs. ![Stars](https://img.shields.io/github/stars/predibase/lorax.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/predibase/lorax?color=green) ![LastCommit](https://img.shields.io/github/last-commit/predibase/lorax?color=green) ![Tag](https://img.shields.io/badge/lora-orange)
 - Cortex.cpp - commit/janhq/cortex.cpp?color=green)
- Benchmark
 - Inference Benchmark - Hypercomputer/inference-benchmark.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/AI-Hypercomputer/inference-benchmark?color=green) ![LastCommit](https://img.shields.io/github/last-commit/AI-Hypercomputer/inference-benchmark?color=green)
 - Inference Perf - sigs/inference-perf.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kubernetes-sigs/inference-perf?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kubernetes-sigs/inference-perf?color=green)
- LLM Router
 - LiteLLM - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]. ![Stars](https://img.shields.io/github/stars/berriai/litellm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/berriai/litellm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/berriai/litellm?color=green)
 - RouteLLM - save LLM costs without compromising quality. ![Stars](https://img.shields.io/github/stars/lm-sys/routellm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/lm-sys/routellm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/lm-sys/routellm?color=green)
 - AI Gateway - ai/gateway.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/portkey-ai/gateway?color=green) ![LastCommit](https://img.shields.io/github/last-commit/portkey-ai/gateway?color=green)
- Inference Platform
 - llmaz - commit/inftyai/llmaz?color=green)
 - llm-d - d is a Kubernetes-native high-performance distributed LLM inference framework ![Stars](https://img.shields.io/github/stars/llm-d/llm-d.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/llm-d/llm-d?color=green) ![LastCommit](https://img.shields.io/github/last-commit/llm-d/llm-d?color=green)
 - Kserve - commit/kserve/kserve?color=green)
 - Mooncake - ai/mooncake.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kvcache-ai/mooncake?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kvcache-ai/mooncake?color=green)
 - LMCache - Context LLM By Smart KV Cache Optimizations. ![Stars](https://img.shields.io/github/stars/lmcache/lmcache.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/lmcache/lmcache?color=green) ![LastCommit](https://img.shields.io/github/last-commit/lmcache/lmcache?color=green) ![Tag](https://img.shields.io/badge/kvcache-orange)
 - AIBrix - efficient and pluggable Infrastructure components for GenAI inference. ![Stars](https://img.shields.io/github/stars/vllm-project/aibrix.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/vllm-project/aibrix?color=green) ![LastCommit](https://img.shields.io/github/last-commit/vllm-project/aibrix?color=green)
 - KubeAI - to-text. ![Stars](https://img.shields.io/github/stars/substratusai/kubeai.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/substratusai/kubeai?color=green) ![LastCommit](https://img.shields.io/github/last-commit/substratusai/kubeai?color=green)
 - Kaito - model inference and fine-tuning, with GPU auto-provisioning, container-based hosting, and CRD-based orchestration. ![Stars](https://img.shields.io/github/stars/kaito-project/Kaito.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kaito-project/Kaito?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kaito-project/Kaito?color=green)
- - LMDeploy
 - MaxText
 - Inference - to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models. | vision |
 - ipex-llm - analytics/ipex-llm.svg) | ![Release](https://img.shields.io/github/release/intel-analytics/ipex-llm) | ![Contributors](https://img.shields.io/github/contributors/intel-analytics/ipex-llm) | Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc. | device |
 - llmaz - of-the-art LLMs on Kubernetes. | |
 - Nanoflow - oriented high-performance serving framework for LLMs |
 - llama.cpp
 - MInference - context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy. | |
 - MLC LLM - ai/mlc-llm.svg) | ![Release](https://img.shields.io/github/release/mlc-ai/mlc-llm) | ![Contributors](https://img.shields.io/github/contributors/mlc-ai/mlc-llm) | Universal LLM Deployment Engine with ML Compilation | |
 - Nanoflow - oriented high-performance serving framework for LLMs | |
 - Ollama
 - OpenLLM
 - Ratchet - platform browser ML framework. | browser |
 - RayServe - project/ray.svg) | ![Release](https://img.shields.io/github/release/ray-project/ray) | ![Contributors](https://img.shields.io/github/contributors/ray-project/ray) | Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. | |
 - RouteLLM - sys/routellm.svg) | ![Release](https://img.shields.io/github/release/lm-sys/routellm) | ![Contributors](https://img.shields.io/github/contributors/lm-sys/routellm) | A framework for serving and evaluating LLM routers - save LLM costs without compromising quality. | cost |
 - TensorRT-LLM - LLM) | ![GitHub Release](https://img.shields.io/github/v/release/NVIDIA/TensorRT-LLM) | ![GitHub contributors](https://img.shields.io/github/contributors/NVIDIA/TensorRT-LLM) | TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.||
 - TensorRT-LLM - LLM) | ![GitHub Release](https://img.shields.io/github/v/release/NVIDIA/TensorRT-LLM) | ![GitHub contributors](https://img.shields.io/github/contributors/NVIDIA/TensorRT-LLM) | TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.||
 - vLLM - project/vllm.svg) | ![Release](https://img.shields.io/github/release/vllm-project/vllm) | ![Contributors](https://img.shields.io/github/contributors/vllm-project/vllm) | A high-throughput and memory-efficient inference and serving engine for LLMs | |
 - zml
 - Triton Inference Server - inference-server/server.svg) ![Release](https://img.shields.io/github/release/triton-inference-server/server) ![Contributors](https://img.shields.io/github/contributors/triton-inference-server/server) | The Triton Inference Server provides an optimized cloud and edge inferencing solution. |
- AI Gateway
 - Kong - Native API Gateway and AI Gateway. ![Stars](https://img.shields.io/github/stars/Kong/kong.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/Kong/kong?color=green) ![LastCommit](https://img.shields.io/github/last-commit/Kong/kong?color=green)
 - gateway-api-inference-extension - sigs/gateway-api-inference-extension.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kubernetes-sigs/gateway-api-inference-extension?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kubernetes-sigs/gateway-api-inference-extension?color=green)
 - APISIX - Native API Gateway and AI Gateway with extensive plugin system and AI capabilities. ![Stars](https://img.shields.io/github/stars/apache/apisix.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/apache/apisix?color=green) ![LastCommit](https://img.shields.io/github/last-commit/apache/apisix?color=green)
 - Envoy AI Gateway - gateway.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/envoyproxy/ai-gateway?color=green) ![LastCommit](https://img.shields.io/github/last-commit/envoyproxy/ai-gateway?color=green)
 - Higress - commit/alibaba/higress?color=green)
 - kgateway - Native API Gateway and AI Gateway. ![Stars](https://img.shields.io/github/stars/kgateway-dev/kgateway.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kgateway-dev/kgateway?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kgateway-dev/kgateway?color=green)
- Output
 - Instructor - ai/instructor.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/instructor-ai/instructor?color=green) ![LastCommit](https://img.shields.io/github/last-commit/instructor-ai/instructor?color=green)
 - Outlines - ai/outlines.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/dottxt-ai/outlines?color=green) ![LastCommit](https://img.shields.io/github/last-commit/dottxt-ai/outlines?color=green)
Orchestration
- Agent
  - Qwen-Agent - Agent.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/QwenLM/Qwen-Agent?color=green) ![LastCommit](https://img.shields.io/github/last-commit/QwenLM/Qwen-Agent?color=green)
  - LangChain - aware reasoning applications. ![Stars](https://img.shields.io/github/stars/langchain-ai/langchain.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/langchain-ai/langchain?color=green) ![LastCommit](https://img.shields.io/github/last-commit/langchain-ai/langchain?color=green)
  - LlamaIndex - powered agents over your data. ![Stars](https://img.shields.io/github/stars/run-llama/llama_index.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/run-llama/llama_index?color=green) ![LastCommit](https://img.shields.io/github/last-commit/run-llama/llama_index?color=green)
  - AutoGPT - gravitas/autogpt.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/significant-gravitas/autogpt?color=green) ![LastCommit](https://img.shields.io/github/last-commit/significant-gravitas/autogpt?color=green)
  - autogen - agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour ![Stars](https://img.shields.io/github/stars/microsoft/autogen.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/microsoft/autogen?color=green) ![LastCommit](https://img.shields.io/github/last-commit/microsoft/autogen?color=green)
  - SWE-agent - agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024] ![Stars](https://img.shields.io/github/stars/SWE-agent/SWE-agent.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/SWE-agent/SWE-agent?color=green) ![LastCommit](https://img.shields.io/github/last-commit/SWE-agent/SWE-agent?color=green)
  - fast-agent - agent.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/evalstate/fast-agent?color=green) ![LastCommit](https://img.shields.io/github/last-commit/evalstate/fast-agent?color=green)
  - Magentic-UI - centered web agent ![Stars](https://img.shields.io/github/stars/microsoft/magentic-ui.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/microsoft/magentic-ui?color=green) ![LastCommit](https://img.shields.io/github/last-commit/microsoft/magentic-ui?color=green)
  - crewAI - playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks. ![Stars](https://img.shields.io/github/stars/crewAIInc/crewAI.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/crewAIInc/crewAI?color=green) ![LastCommit](https://img.shields.io/github/last-commit/crewAIInc/crewAI?color=green)
  - MetaGPT - Agent Framework: First AI Software Company, Towards Natural Language Programming. ![Stars](https://img.shields.io/github/stars/geekan/metagpt.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/geekan/metagpt?color=green) ![LastCommit](https://img.shields.io/github/last-commit/geekan/metagpt?color=green)
  - PydanticAI - ai.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/pydantic/pydantic-ai?color=green) ![LastCommit](https://img.shields.io/github/last-commit/pydantic/pydantic-ai?color=green)
  - Agno - agnostic. ![Stars](https://img.shields.io/github/stars/agno-agi/agno.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/agno-agi/agno?color=green) ![LastCommit](https://img.shields.io/github/last-commit/agno-agi/agno?color=green)
  - LangGraph - ai/langgraph.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/langchain-ai/langgraph?color=green) ![LastCommit](https://img.shields.io/github/last-commit/langchain-ai/langgraph?color=green)
  - OpenManus - commit/mannaandpoem/openmanus?color=green)
  - Swarm - agent orchestration. Managed by OpenAI Solution team. ![Stars](https://img.shields.io/github/stars/openai/swarm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/openai/swarm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/openai/swarm?color=green) ![Tag](https://img.shields.io/badge/experimental-slategray)
  - Semantic Kernel - edge LLM technology quickly and easily into your apps. ![Stars](https://img.shields.io/github/stars/microsoft/semantic-kernel.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/microsoft/semantic-kernel?color=green) ![LastCommit](https://img.shields.io/github/last-commit/microsoft/semantic-kernel?color=green)
  - CAMEL - agent framework. Finding the Scaling Law of Agents. ![Stars](https://img.shields.io/github/stars/camel-ai/camel.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/camel-ai/camel?color=green) ![LastCommit](https://img.shields.io/github/last-commit/camel-ai/camel?color=green)
  - kagent - dev/kagent.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kagent-dev/kagent?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kagent-dev/kagent?color=green) ![Tag](https://img.shields.io/badge/kubernetes-orange)
  - Agent Development Kit (ADK) - source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control. ![Stars](https://img.shields.io/github/stars/google/adk-python.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/google/adk-python?color=green) ![LastCommit](https://img.shields.io/github/last-commit/google/adk-python?color=green)
  - Codex - commit/openai/codex?color=green) ![Tag](https://img.shields.io/badge/coding-orange)
  - OpenAI Agents SDK - agent workflows. ![Stars](https://img.shields.io/github/stars/openai/openai-agents-python.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/openai/openai-agents-python?color=green) ![LastCommit](https://img.shields.io/github/last-commit/openai/openai-agents-python?color=green)
  - Suna - Open Source Generalist AI Agent ![Stars](https://img.shields.io/github/stars/kortix-ai/suna.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/kortix-ai/suna?color=green) ![LastCommit](https://img.shields.io/github/last-commit/kortix-ai/suna?color=green)
  - Swarm - agent orchestration. Managed by OpenAI Solution team. ![Stars](https://img.shields.io/github/stars/openai/swarm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/openai/swarm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/openai/swarm?color=green) ![Tag](https://img.shields.io/badge/experimental-slategray)
- Workflow
  - FastGPT - based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration. ![Stars](https://img.shields.io/github/stars/labring/FastGPT.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/labring/FastGPT?color=green) ![LastCommit](https://img.shields.io/github/last-commit/labring/FastGPT?color=green)
  - Dify - source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production. ![Stars](https://img.shields.io/github/stars/langgenius/dify.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/langgenius/dify?color=green) ![LastCommit](https://img.shields.io/github/last-commit/langgenius/dify?color=green)
  - Flowise - commit/flowiseai/flowise?color=green)
  - Haystack - ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots. ![Stars](https://img.shields.io/github/stars/deepset-ai/haystack.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/deepset-ai/haystack?color=green) ![LastCommit](https://img.shields.io/github/last-commit/deepset-ai/haystack?color=green)
  - Inference - commit/roboflow/inference?color=green) ![Tag](https://img.shields.io/badge/computer_vision-orange)
- Tools
  - Mem0 - commit/mem0ai/mem0?color=green)
  - Browser Use - use/browser-use.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/browser-use/browser-use?color=green) ![LastCommit](https://img.shields.io/github/last-commit/browser-use/browser-use?color=green)
  - Graphiti - Time Knowledge Graphs for AI Agents. ![Stars](https://img.shields.io/github/stars/getzep/graphiti.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/getzep/graphiti?color=green) ![LastCommit](https://img.shields.io/github/last-commit/getzep/graphiti?color=green)
  - OpenAI CUA - cua-sample-app.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/openai/openai-cua-sample-app?color=green) ![LastCommit](https://img.shields.io/github/last-commit/openai/openai-cua-sample-app?color=green)
- RAG
  - GraphRAG - based Retrieval-Augmented Generation (RAG) system. ![Stars](https://img.shields.io/github/stars/microsoft/graphrag.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/microsoft/graphrag?color=green) ![LastCommit](https://img.shields.io/github/last-commit/microsoft/graphrag?color=green)
  - RAGFlow - source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. ![Stars](https://img.shields.io/github/stars/infiniflow/ragflow.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/infiniflow/ragflow?color=green) ![LastCommit](https://img.shields.io/github/last-commit/infiniflow/ragflow?color=green)
  - LightRAG - Augmented Generation" ![Stars](https://img.shields.io/github/stars/HKUDS/LightRAG.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/HKUDS/LightRAG?color=green) ![LastCommit](https://img.shields.io/github/last-commit/HKUDS/LightRAG?color=green)
Outputs
- Instructor
Runtime
- Chatbot
  - FastChat - sys/fastchat.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/lm-sys/fastchat?color=green) ![LastCommit](https://img.shields.io/github/last-commit/lm-sys/fastchat?color=green)
  - AnythingLLM - in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more. ![Stars](https://img.shields.io/github/stars/Mintplex-Labs/anything-llm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/Mintplex-Labs/anything-llm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/Mintplex-Labs/anything-llm?color=green)
  - kubectl-ai - ai.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/GoogleCloudPlatform/kubectl-ai?color=green) ![LastCommit](https://img.shields.io/github/last-commit/GoogleCloudPlatform/kubectl-ai?color=green)
  - LLM - line ![Stars](https://img.shields.io/github/stars/simonw/llm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/simonw/llm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/simonw/llm?color=green)
  - PrivateGPT - ai/private-gpt.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/zylon-ai/private-gpt?color=green) ![LastCommit](https://img.shields.io/github/last-commit/zylon-ai/private-gpt?color=green)
  - NextChat - commit/chatgptnextweb/nextchat?color=green)
  - 5ire - platform desktop AI assistant, MCP client. It compatible with major service providers, supports local knowledge base and tools via model context protocol servers. ![Stars](https://img.shields.io/github/stars/nanbingxyz/5ire.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/nanbingxyz/5ire?color=green) ![LastCommit](https://img.shields.io/github/last-commit/nanbingxyz/5ire?color=green)
  - Chatbot UI - ui.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/mckaywrigley/chatbot-ui?color=green) ![LastCommit](https://img.shields.io/github/last-commit/mckaywrigley/chatbot-ui?color=green)
  - Cherry Studio - r1. ![Stars](https://img.shields.io/github/stars/CherryHQ/cherry-studio.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/CherryHQ/cherry-studio?color=green) ![LastCommit](https://img.shields.io/github/last-commit/CherryHQ/cherry-studio?color=green)
  - Jan - commit/janhq/jan?color=green)
  - Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application. ![Stars](https://img.shields.io/github/stars/lobehub/lobe-chat.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/lobehub/lobe-chat?color=green) ![LastCommit](https://img.shields.io/github/last-commit/lobehub/lobe-chat?color=green)
  - Gradio - app/gradio.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/gradio-app/gradio?color=green) ![LastCommit](https://img.shields.io/github/last-commit/gradio-app/gradio?color=green)
  - Open WebUI - friendly AI Interface (Supports Ollama, OpenAI API, ...). ![Stars](https://img.shields.io/github/stars/open-webui/open-webui.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/open-webui/open-webui?color=green) ![LastCommit](https://img.shields.io/github/last-commit/open-webui/open-webui?color=green)
  - PrivateGPT - ai/private-gpt.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/zylon-ai/private-gpt?color=green) ![LastCommit](https://img.shields.io/github/last-commit/zylon-ai/private-gpt?color=green)
  - Chat SDK - featured, hackable Next.js AI chatbot built by Vercel ![Stars](https://img.shields.io/github/stars/vercel/ai-chatbot.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/vercel/ai-chatbot?color=green) ![LastCommit](https://img.shields.io/github/last-commit/vercel/ai-chatbot?color=green)
- Database
  - Faiss - commit/facebookresearch/faiss?color=green)
  - weaviate - source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database. ![Stars](https://img.shields.io/github/stars/weaviate/weaviate.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/weaviate/weaviate?color=green) ![LastCommit](https://img.shields.io/github/last-commit/weaviate/weaviate?color=green)
  - milvus - performance, cloud-native vector database built for scalable vector ANN search. ![Stars](https://img.shields.io/github/stars/milvus-io/milvus.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/milvus-io/milvus?color=green) ![LastCommit](https://img.shields.io/github/last-commit/milvus-io/milvus?color=green)
  - deeplake - time to PyTorch/TensorFlow. ![Stars](https://img.shields.io/github/stars/activeloopai/deeplake.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/activeloopai/deeplake?color=green) ![LastCommit](https://img.shields.io/github/last-commit/activeloopai/deeplake?color=green)
  - chroma - native open-source embedding database. ![Stars](https://img.shields.io/github/stars/chroma-core/chroma.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/chroma-core/chroma?color=green) ![LastCommit](https://img.shields.io/github/last-commit/chroma-core/chroma?color=green)
  - chroma - native open-source embedding database. ![Stars](https://img.shields.io/github/stars/chroma-core/chroma.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/chroma-core/chroma?color=green) ![LastCommit](https://img.shields.io/github/last-commit/chroma-core/chroma?color=green)
  - deeplake - time to PyTorch/TensorFlow. ![Stars](https://img.shields.io/github/stars/activeloopai/deeplake.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/activeloopai/deeplake?color=green) ![LastCommit](https://img.shields.io/github/last-commit/activeloopai/deeplake?color=green)
- Observation
  - OpenLLMetry - source observability for your LLM application, based on OpenTelemetry. ![Stars](https://img.shields.io/github/stars/traceloop/openllmetry.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/traceloop/openllmetry?color=green) ![LastCommit](https://img.shields.io/github/last-commit/traceloop/openllmetry?color=green)
  - OpenLIT - native LLM Observability, GPU Monitoring ![Stars](https://img.shields.io/github/stars/openlit/openlit.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/openlit/openlit?color=green) ![LastCommit](https://img.shields.io/github/last-commit/openlit/openlit?color=green)
  - phoenix - ai/phoenix.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/arize-ai/phoenix?color=green) ![LastCommit](https://img.shields.io/github/last-commit/arize-ai/phoenix?color=green)
  - Helicone - commit/helicone/helicone?color=green)
  - wandb - tune models, and manage models from experimentation to production. ![Stars](https://img.shields.io/github/stars/wandb/wandb.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/wandb/wandb?color=green) ![LastCommit](https://img.shields.io/github/last-commit/wandb/wandb?color=green)
  - Langfuse - commit/langfuse/langfuse?color=green)
- Code Assistant
  - Auto-dev - powered coding wizard（AI 驱动编程助手）with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀. ![Stars](https://img.shields.io/github/stars/unit-mesh/auto-dev.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/unit-mesh/auto-dev?color=green) ![LastCommit](https://img.shields.io/github/last-commit/unit-mesh/auto-dev?color=green)
  - Codefuse-chatbot - Agent Framework, working with DevOps Toolkits, Code&Doc Repo RAG, etc. ![Stars](https://img.shields.io/github/stars/codefuse-ai/codefuse-chatbot.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/codefuse-ai/codefuse-chatbot?color=green) ![LastCommit](https://img.shields.io/github/last-commit/codefuse-ai/codefuse-chatbot?color=green)
  - Cody - commit/sourcegraph/cody?color=green)
  - Continue - source IDE extensions and hub of models, rules, prompts, docs, and other building blocks. ![Stars](https://img.shields.io/github/stars/continuedev/continue.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/continuedev/continue?color=green) ![LastCommit](https://img.shields.io/github/last-commit/continuedev/continue?color=green)
  - Sweep - commit/sweepai/sweep?color=green)
  - Tabby - hosted AI coding assistant. ![Stars](https://img.shields.io/github/stars/tabbyml/tabby.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/tabbyml/tabby?color=green) ![LastCommit](https://img.shields.io/github/last-commit/tabbyml/tabby?color=green)
- Development Environment
  - E2B - dev/E2B.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/e2b-dev/E2B?color=green) ![LastCommit](https://img.shields.io/github/last-commit/e2b-dev/E2B?color=green)
  - Daytona - Generated Code. ![Stars](https://img.shields.io/github/stars/daytonaio/daytona.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/daytonaio/daytona?color=green) ![LastCommit](https://img.shields.io/github/last-commit/daytonaio/daytona?color=green)
Training
- Workflow
  - Flyte - commit/flyteorg/flyte?color=green)
  - BentoML - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more! ![Stars](https://img.shields.io/github/stars/bentoml/bentoml.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/bentoml/bentoml?color=green) ![LastCommit](https://img.shields.io/github/last-commit/bentoml/bentoml?color=green)
  - Kubeflow - commit/kubeflow/kubeflow?color=green)
  - MLflow - commit/mlflow/mlflow?color=green)
  - ZenML - io/zenml.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/zenml-io/zenml?color=green) ![LastCommit](https://img.shields.io/github/last-commit/zenml-io/zenml?color=green)
  - Ray - project/ray.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/ray-project/ray?color=green) ![LastCommit](https://img.shields.io/github/last-commit/ray-project/ray?color=green)
  - Metaflow - commit/netflix/metaflow?color=green)
  - Polyaxon - commit/polyaxon/polyaxon?color=green)
  - Seldon-Core - core.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/seldonio/seldon-core?color=green) ![LastCommit](https://img.shields.io/github/last-commit/seldonio/seldon-core?color=green)
  - BentoML - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more! ![Stars](https://img.shields.io/github/stars/bentoml/bentoml.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/bentoml/bentoml?color=green) ![LastCommit](https://img.shields.io/github/last-commit/bentoml/bentoml?color=green)
  - Kubeflow - commit/kubeflow/kubeflow?color=green)
  - Metaflow - commit/netflix/metaflow?color=green)
  - MLflow - commit/mlflow/mlflow?color=green)
  - Seldon-Core - core.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/seldonio/seldon-core?color=green) ![LastCommit](https://img.shields.io/github/last-commit/seldonio/seldon-core?color=green)
  - ZenML - io/zenml.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/zenml-io/zenml?color=green) ![LastCommit](https://img.shields.io/github/last-commit/zenml-io/zenml?color=green)
- Framework
  - MaxText - commit/google/maxtext?color=green)
  - ColossalAI - commit/hpcaitech/ColossalAI?color=green)
  - Ludwig - code framework for building custom LLMs, neural networks, and other AI models. ![Stars](https://img.shields.io/github/stars/ludwig-ai/ludwig.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/ludwig-ai/ludwig?color=green) ![LastCommit](https://img.shields.io/github/last-commit/ludwig-ai/ludwig?color=green)
  - MLX - explore/mlx.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/ml-explore/mlx?color=green) ![LastCommit](https://img.shields.io/github/last-commit/ml-explore/mlx?color=green)
  - AXLearn - commit/apple/axlearn?color=green)
  - Candle - commit/huggingface/candle?color=green)
  - DLRover - machine-learning/dlrover.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/intelligent-machine-learning/dlrover?color=green) ![LastCommit](https://img.shields.io/github/last-commit/intelligent-machine-learning/dlrover?color=green)
- FineTune
  - torchtune - training library. ![Stars](https://img.shields.io/github/stars/pytorch/torchtune.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/pytorch/torchtune?color=green) ![LastCommit](https://img.shields.io/github/last-commit/pytorch/torchtune?color=green)
  - unsloth - R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥 ![Stars](https://img.shields.io/github/stars/unslothai/unsloth.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/unslothai/unsloth?color=green) ![LastCommit](https://img.shields.io/github/last-commit/unslothai/unsloth?color=green)
  - Axolotl - ai-cloud/axolotl.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/axolotl-ai-cloud/axolotl?color=green) ![LastCommit](https://img.shields.io/github/last-commit/axolotl-ai-cloud/axolotl?color=green)
  - LLaMa-Factory - Tuning of 100+ LLMs & VLMs (ACL 2024). ![Stars](https://img.shields.io/github/stars/hiyouga/llama-factory.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/hiyouga/llama-factory?color=green) ![LastCommit](https://img.shields.io/github/last-commit/hiyouga/llama-factory?color=green)
  - Swift - parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...). ![Stars](https://img.shields.io/github/stars/modelscope/ms-swift?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/modelscope/ms-swift?color=green) ![LastCommit](https://img.shields.io/github/last-commit/modelscope/ms-swift?color=green)
  - maestro - tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL. ![Stars](https://img.shields.io/github/stars/roboflow/maestro.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/roboflow/maestro?color=green) ![LastCommit](https://img.shields.io/github/last-commit/roboflow/maestro?color=green)
  - EasyLM - training, finetuning, evaluating and serving LLMs in JAX/Flax. ![Stars](https://img.shields.io/github/stars/young-geng/EasyLM.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/young-geng/EasyLM?color=green) ![LastCommit](https://img.shields.io/github/last-commit/young-geng/EasyLM?color=green)
  - LLaMa-Factory - Tuning of 100+ LLMs & VLMs (ACL 2024). ![Stars](https://img.shields.io/github/stars/hiyouga/llama-factory.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/hiyouga/llama-factory?color=green) ![LastCommit](https://img.shields.io/github/last-commit/hiyouga/llama-factory?color=green)
  - LMFlow - commit/optimalscale/lmflow?color=green)
  - MLX-VLM - VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX. ![Stars](https://img.shields.io/github/stars/blaizzy/mlx-vlm.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/blaizzy/mlx-vlm?color=green) ![LastCommit](https://img.shields.io/github/last-commit/blaizzy/mlx-vlm?color=green)
  - Swift - parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...). ![Stars](https://img.shields.io/github/stars/modelscope/ms-swift?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/modelscope/ms-swift?color=green) ![LastCommit](https://img.shields.io/github/last-commit/modelscope/ms-swift?color=green)
  - Transformer Lab - tune, and evaluate large language models on your own computer. ![Stars](https://img.shields.io/github/stars/transformerlab/transformerlab-app.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/transformerlab/transformerlab-app?color=green) ![LastCommit](https://img.shields.io/github/last-commit/transformerlab/transformerlab-app?color=green)
- Evaluation
  - AgentBench - commit/thudm/agentbench?color=green)
  - LongBench - commit/thudm/longbench?color=green)
  - lm-evaluation-harness - shot evaluation of language models. ![Stars](https://img.shields.io/github/stars/eleutherai/lm-evaluation-harness.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/eleutherai/lm-evaluation-harness?color=green) ![LastCommit](https://img.shields.io/github/last-commit/eleutherai/lm-evaluation-harness?color=green)
  - LiveBench - Free LLM Benchmark ![Stars](https://img.shields.io/github/stars/livebench/livebench.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/livebench/livebench?color=green) ![LastCommit](https://img.shields.io/github/last-commit/livebench/livebench?color=green)
  - lm-evaluation-harness - shot evaluation of language models. ![Stars](https://img.shields.io/github/stars/eleutherai/lm-evaluation-harness.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/eleutherai/lm-evaluation-harness?color=green) ![LastCommit](https://img.shields.io/github/last-commit/eleutherai/lm-evaluation-harness?color=green)
  - OpenCompass - 4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. ![Stars](https://img.shields.io/github/stars/open-compass/opencompass.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/open-compass/opencompass?color=green) ![LastCommit](https://img.shields.io/github/last-commit/open-compass/opencompass?color=green)
  - opik - ready dashboards. ![Stars](https://img.shields.io/github/stars/comet-ml/opik.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/comet-ml/opik?color=green) ![LastCommit](https://img.shields.io/github/last-commit/comet-ml/opik?color=green)
- - ColossalAI
  - Ludwig - ai/ludwig.svg) | ![Release](https://img.shields.io/github/release/ludwig-ai/ludwig) | ![Contributors](https://img.shields.io/github/contributors/ludwig-ai/ludwig) | Low-code framework for building custom LLMs, neural networks, and other AI models | |
- Alignment
  - OpenRLHF - to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT). ![Stars](https://img.shields.io/github/stars/openllmai/openrlhf.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/openllmai/openrlhf?color=green) ![LastCommit](https://img.shields.io/github/last-commit/openllmai/openrlhf?color=green)
  - Self-RLHF - alignment/safe-rlhf.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/pku-alignment/safe-rlhf?color=green) ![LastCommit](https://img.shields.io/github/last-commit/pku-alignment/safe-rlhf?color=green)
- Inference Platform
  - MLX - explore/mlx.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/ml-explore/mlx?color=green) ![LastCommit](https://img.shields.io/github/last-commit/ml-explore/mlx?color=green)
LLMOps
- Dify
- FastChat - sys/fastchat.svg) | ![Release](https://img.shields.io/github/release/lm-sys/fastchat) | ![Contributors](https://img.shields.io/github/contributors/lm-sys/fastchat) | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. | |
- Flowise
- LiteLLM - Azure, OpenAI, Cohere, Anthropic, Replicate. Manages input/output translation | |
- Mem0
- LlamaIndex - llama/llama_index.svg) ![Release](https://img.shields.io/github/release/run-llama/llama_index) ![Contributors](https://img.shields.io/github/contributors/run-llama/llama_index) | LlamaIndex is a data framework for your LLM applications |
MLOps
- Flyte
Agent
- - XAgent
 - XAgent
 - AutoGPT - Gravitas/AutoGPT.svg) ![Release](https://img.shields.io/github/release/Significant-Gravitas/AutoGPT) ![Contributors](https://img.shields.io/github/contributors/Significant-Gravitas/AutoGPT) | An experimental open-source attempt to make GPT-4 fully autonomous. |
- Framework
 - MetaGPT - Agent Framework: First AI Software Company, Towards Natural Language Programming. ![Stars](https://img.shields.io/github/stars/geekan/metagpt.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/geekan/metagpt?color=green) ![LastCommit](https://img.shields.io/github/last-commit/geekan/metagpt?color=green)
 - PydanticAI - ai.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/pydantic/pydantic-ai?color=green) ![LastCommit](https://img.shields.io/github/last-commit/pydantic/pydantic-ai?color=green)
FineTune
- torchtune - PyTorch Library for LLM Fine-tuning | |
- unsloth - 5x faster with 80% less memory | |
- Axolotl - ai-cloud/axolotl.svg) ![Release](https://img.shields.io/github/release/axolotl-ai-cloud/axolotl) ![Contributors](https://img.shields.io/github/contributors/axolotl-ai-cloud/axolotl) | Go ahead and axolotl questions |
Evaluation
- AgentBench
DB Store
- Faiss
- milvus - io/milvus.svg) ![Release](https://img.shields.io/github/release/milvus-io/milvus) ![Contributors](https://img.shields.io/github/contributors/milvus-io/milvus) | A cloud-native vector database, storage for next generation AI applications |
Observation
- - Helicone AI - source LangSmith alternative for logging, monitoring, and debugging AI applications.|
  - OpenLLMetry - source observability for your LLM application, based on OpenTelemetry |
  - Helicone AI - source LangSmith alternative for logging, monitoring, and debugging AI applications.|
- Inference Platform
  - phoenix - ai/phoenix.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/arize-ai/phoenix?color=green) ![LastCommit](https://img.shields.io/github/last-commit/arize-ai/phoenix?color=green)
- MCP Client
  - wandb - tune models, and manage models from experimentation to production. ![Stars](https://img.shields.io/github/stars/wandb/wandb.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/wandb/wandb?color=green) ![LastCommit](https://img.shields.io/github/last-commit/wandb/wandb?color=green)
GPU
- Scheduling
  - Project-HAMi - HAMi/HAMi.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/Project-HAMi/HAMi?color=green) ![LastCommit](https://img.shields.io/github/last-commit/Project-HAMi/HAMi?color=green)
  - KAI Scheduler - Scheduler.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/NVIDIA/KAI-Scheduler?color=green) ![LastCommit](https://img.shields.io/github/last-commit/NVIDIA/KAI-Scheduler?color=green)
- Management
  - NVIDIA GPU Operator - operator.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/NVIDIA/gpu-operator?color=green) ![LastCommit](https://img.shields.io/github/last-commit/NVIDIA/gpu-operator?color=green)
Alignment
- Tools
  - OpenRLHF - to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT). ![Stars](https://img.shields.io/github/stars/openllmai/openrlhf.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/openllmai/openrlhf?color=green) ![LastCommit](https://img.shields.io/github/last-commit/openllmai/openrlhf?color=green)
  - Self-RLHF - alignment/safe-rlhf.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/pku-alignment/safe-rlhf?color=green) ![LastCommit](https://img.shields.io/github/last-commit/pku-alignment/safe-rlhf?color=green)
Application Orchestration Framework
- Tools
  - Haystack - ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots. ![Stars](https://img.shields.io/github/stars/deepset-ai/haystack.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/deepset-ai/haystack?color=green) ![LastCommit](https://img.shields.io/github/last-commit/deepset-ai/haystack?color=green)
  - LangChain - aware reasoning applications. ![Stars](https://img.shields.io/github/stars/langchain-ai/langchain.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/langchain-ai/langchain?color=green) ![LastCommit](https://img.shields.io/github/last-commit/langchain-ai/langchain?color=green)
Database
- Tools
  - weaviate - source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database. ![Stars](https://img.shields.io/github/stars/weaviate/weaviate.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/weaviate/weaviate?color=green) ![LastCommit](https://img.shields.io/github/last-commit/weaviate/weaviate?color=green)
MCP
- MCP Client
  - awesome-mcp-clients - mcp-clients.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/punkpeye/awesome-mcp-clients?color=green) ![LastCommit](https://img.shields.io/github/last-commit/punkpeye/awesome-mcp-clients?color=green)
- MCP Server
  - awesome-mcp-servers - mcp-servers.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/punkpeye/awesome-mcp-servers?color=green) ![LastCommit](https://img.shields.io/github/last-commit/punkpeye/awesome-mcp-servers?color=green)
  - mcp-directory - directory.svg?style=flat&color=green) ![Contributors](https://img.shields.io/github/contributors/chatmcp/mcp-directory?color=green) ![LastCommit](https://img.shields.io/github/last-commit/chatmcp/mcp-directory?color=green)
  - Cline MCP Marketplace
  - BaiLian MCP
  - Docker MCP Catalog - quality MCP servers as Docker Images, spanning database solutions, developer tools, productivity platforms, and API integrations.
  - Higress MCP Marketplace
  - MCPMarket
  - ModelScope MCP

Programming Languages

Python 122 TypeScript 28 Go 15 C++ 10 Rust 7 Jupyter Notebook 7 JavaScript 3 HTML 2 Lua 2 Zig 2

Awesome-LLMOps

Inference

Inference Engine

Benchmark

LLM Router

Inference Platform

AI Gateway

Output

Orchestration

Agent

Workflow

Tools

RAG

Outputs

Runtime

Chatbot

Database

Observation

Code Assistant

Development Environment

Training

Workflow

Framework

FineTune

Evaluation

Alignment

Inference Platform

LLMOps

MLOps

Agent

Framework

FineTune

Evaluation

DB Store

Observation

Inference Platform

MCP Client

GPU

Scheduling

Management

Alignment

Tools

Application Orchestration Framework

Tools

Database

Tools

MCP

MCP Client

MCP Server