awesome-LLMs-finetuning
  
  
    Collection of resources for finetuning Large Language Models (LLMs).  
    https://github.com/pdaicode/awesome-LLMs-finetuning
  
        Last synced: 6 days ago 
        JSON representation
    
- 
            1. LLM Performance & Concepts- 
                    
                    - Chatbot Arena Leaderboard - a benchmark platform for large language models (LLMs) that features anonymous, randomized battles in a crowdsourced manner.
- AlpacaEval Leaderboard - An Automatic Evaluator for Instruction-following Language Models
- Open Ko-LLM Leaderboard - The Open Ko-LLM Leaderboard objectively evaluates the performance of Korean Large Language Model (LLM).
- Yet Another LLM Leaderboard - Leaderboard made with LLM AutoEval using Nous benchmark suite.
- Open LLM Leaderboard - aims to track, rank and evaluate LLMs and chatbots as they are released.
- OpenCompass 2.0 LLM Leaderboard - OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
 
- 
                    Kaggle & Colab Notebooks
- 
                    Courses & Lectures
 
- 
                    
                    
- 
            3. LLM and Applications
- 
            2. LLM Backbones- 
                    Blogs- InternVL
- InternLM2
- Qwen
- Vicuna-13B - source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT.
- OPT - trained Transformer Language Models by Meta AI. OPT is a series of open-sourced large causal language models which perform similar in performance to GPT3.
- Gemma - based large language model developed by Google AI (2B, 7B).
- Chinchilla
- Adept
- Fuyu
- PaLM - based large language model developed by Google AI.
- Mistral AI
- PanGu-α - PanGu-α is a 200B parameter autoregressive pretrained Chinese language model develped by Huawei Noah's Ark Lab, MindSpore Team and Peng Cheng Laboratory.
 
- 
                    Multi-Modal LLMs
 
- 
                    
- 
            4. Fine-Tuning- 
                    Papers
- 
                    Frameworks- Ollama
- LlamaIndex
- Petals - style. Fine-tuning and inference up to 10x faster than offloading. (7768 stars)
- LLaMA-Factory - to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM3). (5532 stars)
- H2O LLM Studio - code GUI for fine-tuning LLMs. Documentation: [https://h2oai.github.io/h2o-llmstudio/](https://h2oai.github.io/h2o-llmstudio/) (2880 stars)
- Phoenix - Evaluate, troubleshoot, and fine tune your LLM, CV, and NLP models in a notebook. (1596 stars)
- LLM-Adapters - Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models". (769 stars)
- Platypus - tuning Platypus fam LLMs using LoRA. (589 stars)
- xtuner - tuning LLM (InternLM, Llama, Baichuan, QWen, ChatGLM2). (540 stars)
- DB-GPT-Hub - tuning techniques for DB-GPT, with the purpose of enhancing model performance, especially in Text-to-SQL, and achieved higher exec acc than GPT-4 in spider eval with 13B LLM used this project. (422 stars)
- LLM-Finetuning-Hub - tuning and deployment scripts along with our research findings. :star: 416
- Finetune_LLMs - tuning Casual LLMs. :star: 391
- llmware - grade LLM-based development framework, tools, and fine-tuned models. :star: 289
- LLM-Kit
- h2o-wizardlm - Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning. :star: 228
- llm_qlora - tuning LLMs using QLoRA. :star: 136
- awesome-llm-human-preference-datasets - tuning, RLHF, and eval. :star: 124
- llm_finetuning - tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes). :star: 114
- MFTCoder - task fine-tuning framework for Code LLMs; 业内首个高精度、高效率、多任务、多模型支持、多训练算法,大模型代码能力微调框架. :star: 337
- hcgf - model Fine-tuning | LLM微调. :star: 196
 
 
- 
                    
- 
            This repo is based on the following resources- 
                    Frameworks
 
- 
                    
- 
            5. Tools & Software- 
                    Frameworks- NVIDIA NeMo - of-the-art conversational AI models and specifically designed for Linux. 🚀
- H2O LLM Studio - code GUI tool for fine-tuning large language models on Windows. 🎛️
- Petals - 176B collaboratively, allowing you to load a small part of the model and team up with others for inference or fine-tuning. 🌸
- LLaMA Efficient Tuning - to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon).
- H2O LLM Studio - code GUI for fine-tuning LLMs.
- PEFT - Efficient Fine-Tuning (PEFT) methods for efficient adaptation of pre-trained language models to downstream applications.
- ChatGPT-like model - like model locally on your device.
- promptfoo
- bert4torch - source large model weights for reasoning and fine-tuning. 🔥
- Alpaca.cpp - like model locally on your device. A combination of the LLaMA foundation model and an open reproduction of Stanford Alpaca for instruction-tuned fine-tuning. 🦙
- Ludwig AI - code framework for building custom LLMs and other deep neural networks. Easily train state-of-the-art LLMs with a declarative YAML configuration file. 🤖
 
 
- 
                    
            Programming Languages
          
          
        
            Categories
          
          
        
            Sub Categories
          
          
        
            Keywords
          
          
              
                llm
                16
              
              
                gpt
                8
              
              
                large-language-models
                8
              
              
                fine-tuning
                8
              
              
                llama
                7
              
              
                chatbot
                6
              
              
                llama2
                6
              
              
                nlp
                4
              
              
                rlhf
                4
              
              
                chatgpt
                3
              
              
                pretrained-models
                3
              
              
                datasets
                3
              
              
                llama3
                3
              
              
                llm-training
                3
              
              
                llava
                3
              
              
                qwen
                3
              
              
                falcon
                3
              
              
                agents
                3
              
              
                qlora
                2
              
              
                finetuning
                2
              
              
                phi3
                2
              
              
                peft
                2
              
              
                lora
                2
              
              
                mistral
                2
              
              
                llms
                2
              
              
                chatglm
                2
              
              
                ai
                2
              
              
                agent
                2
              
              
                chatglm2
                2
              
              
                llamaindex
                2
              
              
                multimodal
                2
              
              
                mixtral
                2
              
              
                machine-learning
                2
              
              
                deep-learning
                2
              
              
                instruction-tuning
                2
              
              
                gpt-4
                2
              
              
                flash-attention
                2
              
              
                foundation-models
                2
              
              
                chinese
                2
              
              
                vision-language-model
                2
              
              
                image-text-retrieval
                1
              
              
                application
                1
              
              
                multi-modal
                1
              
              
                semantic-segmentation
                1
              
              
                video-classification
                1
              
              
                vit-22b
                1
              
              
                phi4
                1
              
              
                vit-6b
                1
              
              
                ollama
                1
              
              
                fine-tuning-llm
                1