An open API service indexing awesome lists of open source software.

llmops

🚀 The Ultimate Curated List of LLMOps Tools, Frameworks, and Resources - A comprehensive collection of the best tools for Large Language Model Operations
https://github.com/pmady/llmops

Last synced: about 20 hours ago
JSON representation

  • Development Tools

    • Notebooks & Workspaces

    • IDEs & Code Assistants

      • GitHub Copilot
      • Cody
      • Continue - source AI code assistant | ![Stars](https://img.shields.io/github/stars/continuedev/continue?style=flat-square) |
      • Tabby - hosted AI coding assistant | ![Stars](https://img.shields.io/github/stars/TabbyML/tabby?style=flat-square) |
      • Cursor - first code editor | N/A |
  • LLMOps Platforms

    • Notebooks & Workspaces

      • Humanloop
      • Agenta - AI/agenta?style=flat-square) |
      • Dify - square) |
      • Pezzo - source LLMOps platform | ![Stars](https://img.shields.io/github/stars/pezzolabs/pezzo?style=flat-square) |
  • Resources & Learning

  • Prompt Engineering

    • Resources

      • Pinecone
      • pgvector - square) |
      • Weaviate - square) |
      • Milvus - native vector database | ![Stars](https://img.shields.io/github/stars/milvus-io/milvus?style=flat-square) |
      • Qdrant - square) |
      • Chroma - native embedding database | ![Stars](https://img.shields.io/github/stars/chroma-core/chroma?style=flat-square) |
      • FAISS - square) |
      • LanceDB - friendly vector database | ![Stars](https://img.shields.io/github/stars/lancedb/lancedb?style=flat-square) |
  • Models

    • Large Language Models

      • Falcon - performance open models | N/A | Apache 2.0 |
      • Gemma
      • Phi
      • Vicuna - tuning LLaMA | ![Stars](https://img.shields.io/github/stars/lm-sys/FastChat?style=flat-square) | Apache 2.0 |
      • Alpaca - following model | ![Stars](https://img.shields.io/github/stars/tatsu-lab/stanford_alpaca?style=flat-square) | Apache 2.0 |
      • BELLE - square) | Apache 2.0 |
      • ChatGLM - 6B?style=flat-square) | Apache 2.0 |
      • Qwen - square) | Apache 2.0 |
      • DeepSeek - effective open-source LLMs | ![Stars](https://img.shields.io/github/stars/deepseek-ai/DeepSeek-LLM?style=flat-square) | MIT |
      • Bloom - workshop/model_card?style=flat-square) | RAIL |
      • LLaMA - square) | Research |
      • Mistral - performance open models from Mistral AI | ![Stars](https://img.shields.io/github/stars/mistralai/mistral-src?style=flat-square) | Apache 2.0 |
    • Audio Foundation Models

    • Multimodal Models

      • LLaVA - liu/LLaVA?style=flat-square) |
      • Qwen-VL - language model from Alibaba | ![Stars](https://img.shields.io/github/stars/QwenLM/Qwen-VL?style=flat-square) |
      • MiniCPM-V - V?style=flat-square) |
  • Inference & Serving

    • Inference Platforms

      • LM Studio
      • LocalAI - compatible API for local models | ![Stars](https://img.shields.io/github/stars/mudler/LocalAI?style=flat-square) |
      • Ollama - square) |
      • Ray Serve - project/ray?style=flat-square) |
      • OpenLLM - square) |
      • GPUStack - square) |
    • Model Serving Frameworks

    • Inference Engines

      • vLLM - throughput and memory-efficient inference engine | ![Stars](https://img.shields.io/github/stars/vllm-project/vllm?style=flat-square) |
      • TensorRT-LLM - LLM?style=flat-square) |
      • LMDeploy - square) |
      • LoRAX - LoRA inference server | ![Stars](https://img.shields.io/github/stars/predibase/lorax?style=flat-square) |
      • CTranslate2 - square) |
      • Cortex.cpp - square) |
      • MInference - context LLM inference | ![Stars](https://img.shields.io/github/stars/microsoft/minference?style=flat-square) |
      • DeepSpeed-MII - latency inference powered by DeepSpeed | ![Stars](https://img.shields.io/github/stars/microsoft/DeepSpeed-MII?style=flat-square) |
      • llama.cpp - square) |
  • What's New

    • 🆕 Recently Added (January 2026)

      • Modal - Serverless platform for AI/ML workloads
      • PromptFoo - Test and evaluate LLM outputs
      • Composio - Integration platform for AI agents
      • Skypilot - Run LLMs on any cloud with one command
      • Traceloop - OpenTelemetry for LLMs
      • Ragas - Evaluation framework for RAG pipelines
      • LangWatch - LLM monitoring and analytics
      • Phidata - Build AI assistants with memory and knowledge
  • Acknowledgments

  • Training & Fine-Tuning

    • Fine-Tuning Tools

      • PEFT - Efficient Fine-Tuning | ![Stars](https://img.shields.io/github/stars/huggingface/peft?style=flat-square) |
      • LLaMA-Factory - tuning framework | ![Stars](https://img.shields.io/github/stars/hiyouga/LLaMA-Factory?style=flat-square) |
      • TRL - square) |
      • Unsloth - tuning | ![Stars](https://img.shields.io/github/stars/unslothai/unsloth?style=flat-square) |
      • LitGPT - tune, deploy LLMs | ![Stars](https://img.shields.io/github/stars/Lightning-AI/litgpt?style=flat-square) |
    • Experiment Tracking

      • MLflow - source ML lifecycle platform | ![Stars](https://img.shields.io/github/stars/mlflow/mlflow?style=flat-square) |
      • TensorBoard - square) |
      • Aim - to-use experiment tracker | ![Stars](https://img.shields.io/github/stars/aimhubio/aim?style=flat-square) |
      • Weights & Biases - square) |
    • Training Frameworks

  • Orchestration

    • Application Frameworks

      • LangChain - ai/langchain?style=flat-square) |
      • LlamaIndex - llama/llama_index?style=flat-square) |
      • Langfuse - source LLM engineering platform | ![Stars](https://img.shields.io/github/stars/langfuse/langfuse?style=flat-square) |
      • Semantic Kernel - kernel?style=flat-square) |
      • Haystack - to-end NLP framework | ![Stars](https://img.shields.io/github/stars/deepset-ai/haystack?style=flat-square) |
      • Neurolink - square) |
    • Agent Frameworks

      • AutoGPT - Gravitas/AutoGPT?style=flat-square) |
      • AutoGen - agent conversation framework | ![Stars](https://img.shields.io/github/stars/microsoft/autogen?style=flat-square) |
      • LangGraph - actor applications | ![Stars](https://img.shields.io/github/stars/langchain-ai/langgraph?style=flat-square) |
      • AgentMark - safe Markdown-based agents | ![Stars](https://img.shields.io/github/stars/puzzlet-ai/agentmark?style=flat-square) |
      • CrewAI - square) |
    • Workflow Management

      • Airflow - square) |
      • Flowise - square) |
      • Prefect - square) |
      • Flyte - native workflow automation | ![Stars](https://img.shields.io/github/stars/flyteorg/flyte?style=flat-square) |
  • Data Management

  • Optimization & Performance

    • Resources

      • ONNX Runtime - platform ML accelerator | ![Stars](https://img.shields.io/github/stars/microsoft/onnxruntime?style=flat-square) |
      • TVM - square) |
      • GPTQ-for-LLaMa - bit quantization for LLaMA | ![Stars](https://img.shields.io/github/stars/qwopqwop200/GPTQ-for-LLaMa?style=flat-square) |
      • BitsAndBytes - bit optimizers and quantization | ![Stars](https://img.shields.io/github/stars/TimDettmers/bitsandbytes?style=flat-square) |
      • AutoGPTQ - to-use LLM quantization | ![Stars](https://img.shields.io/github/stars/PanQiWei/AutoGPTQ?style=flat-square) |
  • Observability & Monitoring

    • Resources

      • PostHog - square) |
      • Evidently - square) |
      • DeepEval - ai/deepeval?style=flat-square) |
      • Phoenix - ai/phoenix?style=flat-square) |
      • Helicone - source LLM observability | ![Stars](https://img.shields.io/github/stars/Helicone/helicone?style=flat-square) |
      • OpenLIT - native LLM observability | ![Stars](https://img.shields.io/github/stars/openlit/openlit?style=flat-square) |
      • Lunary
  • Security & Safety

  • Star History