awesome-llmops
  
  
    A curated list of tools, frameworks, platforms, and resources for Large Language Model Operations (LLMOps). 
    https://github.com/awesomelistsio/awesome-llmops
  
        Last synced: about 19 hours ago 
        JSON representation
    
- 
            Overview & Learning- LLMOps Guide (Weights & Biases) - level overview of LLMOps concepts and tools.
- LLMOps Field Guide (Fiddler)
- LangChain Cookbook
- Full Stack Deep Learning
 
- 
            Model Training & Fine-Tuning- Hugging Face Transformers - trained and fine-tunable LLMs.
- PEFT - Efficient Fine-Tuning methods for LLMs.
- LoRA - tuning strategy for large models.
- Colossal-AI
 
- 
            Evaluation & Benchmarking
- 
            Serving & Inference- vLLM - efficient inference for LLMs with continuous batching.
- TGI (Text Generation Inference) - performance inference server by Hugging Face.
- DeepSpeed MII - latency inference for Hugging Face models.
- Ray Serve
 
- 
            Monitoring & Observability
- 
            Prompt Engineering & Management
- 
            Data Management- Weaviate
- Pinecone - augmented generation.
- ChromaDB - source embeddings DB built for LLMs.
- Label Studio - source data labeling for fine-tuning and RAG pipelines.
 
- 
            Security & Safety- Guardrails AI
- Rebuff - source framework for prompt injection defense.
- Giskard
- OpenAI Moderation API
 
- 
            Platforms & Frameworks- LangChain - to-end LLM-powered apps.
- LLamaIndex
- RAGStack (Haystack) - augmented generation framework.
- FastChat - tuning chat LLMs.
 
- 
            Tooling Ecosystem- MLflow
- PromptLayer
- OpenLLM - source platform to deploy and manage LLMs in production.
 
- 
            Related Awesome Lists
            Programming Languages
          
          
        
            Categories
          
          
        
            Sub Categories
          
          
            Keywords
          
          
              
                llm
                7
              
              
                deep-learning
                6
              
              
                llmops
                5
              
              
                mlops
                5
              
              
                pytorch
                5
              
              
                inference
                4
              
              
                transformer
                4
              
              
                language-model
                3
              
              
                machine-learning
                3
              
              
                fine-tuning
                2
              
              
                evaluation-framework
                2
              
              
                gpt
                2
              
              
                llama
                2
              
              
                llm-serving
                2
              
              
                rag
                2
              
              
                python
                2
              
              
                prompt-engineering
                2
              
              
                llm-eval
                2
              
              
                nlp
                2
              
              
                llm-evaluation
                2
              
              
                ci
                1
              
              
                explainable-ml
                1
              
              
                openai
                1
              
              
                neural-networks
                1
              
              
                amd
                1
              
              
                cuda
                1
              
              
                deepseek
                1
              
              
                hpu
                1
              
              
                inferentia
                1
              
              
                chatgpt
                1
              
              
                starcoder
                1
              
              
                model-serving
                1
              
              
                falcon
                1
              
              
                qwen
                1
              
              
                rocm
                1
              
              
                bloom
                1
              
              
                tpu
                1
              
              
                trainium
                1
              
              
                xpu
                1
              
              
                bert
                1
              
              
                flax
                1
              
              
                jax
                1
              
              
                language-models
                1
              
              
                model-hub
                1
              
              
                natural-language-processing
                1
              
              
                nlp-library
                1
              
              
                pretrained-models
                1
              
              
                pytorch-transformers
                1
              
              
                seq2seq
                1
              
              
                speech-recognition
                1