awesome-llmops
  
  
    A curated list of tools, frameworks, platforms, and resources for Large Language Model Operations (LLMOps). 
    https://github.com/awesomelistsio/awesome-llmops
  
        Last synced: 4 days ago 
        JSON representation
    
- 
            
Overview & Learning
- LLMOps Guide (Weights & Biases) - level overview of LLMOps concepts and tools.
 - LLMOps Field Guide (Fiddler)
 - LangChain Cookbook
 - Full Stack Deep Learning
 
 - 
            
Model Training & Fine-Tuning
- Hugging Face Transformers - trained and fine-tunable LLMs.
 - PEFT - Efficient Fine-Tuning methods for LLMs.
 - LoRA - tuning strategy for large models.
 - Colossal-AI
 
 - 
            
Evaluation & Benchmarking
 - 
            
Serving & Inference
- vLLM - efficient inference for LLMs with continuous batching.
 - TGI (Text Generation Inference) - performance inference server by Hugging Face.
 - DeepSpeed MII - latency inference for Hugging Face models.
 - Ray Serve
 
 - 
            
Monitoring & Observability
 - 
            
Prompt Engineering & Management
 - 
            
Data Management
- Weaviate
 - Pinecone - augmented generation.
 - ChromaDB - source embeddings DB built for LLMs.
 - Label Studio - source data labeling for fine-tuning and RAG pipelines.
 
 - 
            
Security & Safety
- Guardrails AI
 - Rebuff - source framework for prompt injection defense.
 - Giskard
 - OpenAI Moderation API
 
 - 
            
Platforms & Frameworks
- LangChain - to-end LLM-powered apps.
 - LLamaIndex
 - RAGStack (Haystack) - augmented generation framework.
 - FastChat - tuning chat LLMs.
 
 - 
            
Tooling Ecosystem
- MLflow
 - PromptLayer
 - OpenLLM - source platform to deploy and manage LLMs in production.
 
 - 
            
Related Awesome Lists
 
            Programming Languages
          
          
        
            Categories
          
          
        
            Sub Categories
          
          
            Keywords
          
          
              
                llm
                7
              
              
                deep-learning
                6
              
              
                llmops
                5
              
              
                mlops
                5
              
              
                pytorch
                5
              
              
                inference
                4
              
              
                transformer
                4
              
              
                language-model
                3
              
              
                machine-learning
                3
              
              
                fine-tuning
                2
              
              
                evaluation-framework
                2
              
              
                gpt
                2
              
              
                llama
                2
              
              
                llm-serving
                2
              
              
                rag
                2
              
              
                python
                2
              
              
                prompt-engineering
                2
              
              
                llm-eval
                2
              
              
                nlp
                2
              
              
                llm-evaluation
                2
              
              
                ci
                1
              
              
                explainable-ml
                1
              
              
                openai
                1
              
              
                neural-networks
                1
              
              
                amd
                1
              
              
                cuda
                1
              
              
                deepseek
                1
              
              
                hpu
                1
              
              
                inferentia
                1
              
              
                chatgpt
                1
              
              
                starcoder
                1
              
              
                model-serving
                1
              
              
                falcon
                1
              
              
                qwen
                1
              
              
                rocm
                1
              
              
                bloom
                1
              
              
                tpu
                1
              
              
                trainium
                1
              
              
                xpu
                1
              
              
                bert
                1
              
              
                flax
                1
              
              
                jax
                1
              
              
                language-models
                1
              
              
                model-hub
                1
              
              
                natural-language-processing
                1
              
              
                nlp-library
                1
              
              
                pretrained-models
                1
              
              
                pytorch-transformers
                1
              
              
                seq2seq
                1
              
              
                speech-recognition
                1