Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.
https://github.com/Mooler0410/LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
https://github.com/Mooler0410/LLMsPracticalGuide
large-language-models natural-language-processing nlp survey
Last synced: 3 months ago
JSON representation
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Host: GitHub
URL: https://github.com/Mooler0410/LLMsPracticalGuide
Owner: Mooler0410
Created: 2023-04-23T04:22:20.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-01-10T01:39:27.000Z (5 months ago)
Last Synced: 2024-01-16T18:09:45.400Z (5 months ago)
Topics: large-language-models, natural-language-processing, nlp, survey
Homepage: https://arxiv.org/abs/2304.13712v2
Size: 26.6 MB
Stars: 7,760
Watchers: 168
Forks: 584
Open Issues: 15
Metadata Files:
- Readme: README.md
Lists

Awesome-LLM - LLMsPracticalGuide - A curated list of practical guide resources of LLMs (Other Papers)
awesome-stars - Mooler0410/LLMsPracticalGuide
awesome-gpt - https://github.com/Mooler0410/LLMsPracticalGuide
my-awesome-stars - Mooler0410/LLMsPracticalGuide - A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) (Others)
awesome-stars - Mooler0410/LLMsPracticalGuide - A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) (Others)
awesome - Mooler0410/LLMsPracticalGuide - A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) (miscellaneous)
Awesome-LLM?tab=readme-ov-file - LLMsPracticalGuide - A curated list of practical guide resources of LLMs (Other Papers)
my-awesome-stars - Mooler0410/LLMsPracticalGuide - A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) (Others)
awesome-stars - LLMsPracticalGuide
awesome_llm - The Practical Guides for LLMs
awesome-stars - Mooler0410/LLMsPracticalGuide - A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers) (Others)
awesome-generative-ai - 🔥🔥🔥
README

        
The Practical Guides for Large Language Models 




		     



A curated (still actively updated) list of practical guide resources of LLMs. It's based on our survey paper: [Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond](https://arxiv.org/abs/2304.13712) and efforts from @[xinyadu](https://github.com/xinyadu). The survey is partially based on the second half of this [Blog](https://jingfengyang.github.io/gpt). We also build an evolutionary tree of modern Large Language Models (LLMs) to trace the development of language models in recent years and highlights some of the most well-known models. 

These sources aim to help practitioners navigate the vast landscape of large language models (LLMs) and their applications in natural language processing (NLP) applications. We also include their usage restrictions based on the model and data licensing information.

If you find any resources in our repository helpful, please feel free to use them (don't forget to cite our paper! 😃). We welcome pull requests to refine this figure! 







```bibtex

    @article{yang2023harnessing,

        title={Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond}, 

        author={Jingfeng Yang and Hongye Jin and Ruixiang Tang and Xiaotian Han and Qizhang Feng and Haoming Jiang and Bing Yin and Xia Hu},

        year={2023},

        eprint={2304.13712},

        archivePrefix={arXiv},

        primaryClass={cs.CL}

    }

```

## Latest News💥

- We added usage and restrictions section.

- We used PowerPoint to plot the figure and released the source file [pptx](./source/figure_gif.pptx) for our GIF figure. [4/27/2023]

- We released the source file for the still version [pptx](./source/figure_still.pptx), and replaced the figure in this repo with the still version. [4/29/2023]

- Add AlexaTM, UniLM, UniLMv2 to the figure, and correct the logo for Tk. [4/29/2023]

- Add usage and Restrictions (for commercial and research purposes) section. Credits to [Dr. Du](https://github.com/xinyadu).  [5/8/2023]

## Other Practical Guides for LLMs

- **Why did all of the public reproduction of GPT-3 fail? In which tasks should we use GPT-3.5/ChatGPT?** 2023, [Blog](https://jingfengyang.github.io/gpt) 

- **Building LLM applications for production**, 2023, [Blog](https://huyenchip.com/2023/04/11/llm-engineering.html)

- **Data-centric Artificial Intelligence**, 2023, [Repo](https://github.com/daochenzha/data-centric-AI)/[Blog](https://towardsdatascience.com/what-are-the-data-centric-ai-concepts-behind-gpt-models-a590071bb727)/[Paper](https://arxiv.org/abs/2303.10158)

## Catalog

* [The Practical Guides for Large Language Models ](#the-practical-guides-for-large-language-models-)

   * [Practical Guide for Models](#practical-guide-for-models)

      * [BERT-style Language Models: Encoder-Decoder or Encoder-only](#bert-style-language-models-encoder-decoder-or-encoder-only)

      * [GPT-style Language Models: Decoder-only](#gpt-style-language-models-decoder-only)

   * [Practical Guide for Data](#practical-guide-for-data)

      * [Pretraining data](#pretraining-data)

      * [Finetuning data](#finetuning-data)

      * [Test data/user data](#test-datauser-data)

   * [Practical Guide for NLP Tasks](#practical-guide-for-nlp-tasks)

      * [Traditional NLU tasks](#traditional-nlu-tasks)

      * [Generation tasks](#generation-tasks)

      * [Knowledge-intensive tasks](#knowledge-intensive-tasks)

      * [Abilities with Scaling](#abilities-with-scaling)

      * [Specific tasks](#specific-tasks)

      * [Real-World ''Tasks''](#real-world-tasks)

      * [Efficiency](#efficiency)

      * [Trustworthiness](#trustworthiness)

      * [Benchmark Instruction Tuning](#benchmark-instruction-tuning)

      * [Alignment](#alignment)

         * [Safety Alignment (Harmless)](#safety-alignment-harmless)

         * [Truthfulness Alignment (Honest)](#truthfulness-alignment-honest)

         * [Practical Guides for Prompting (Helpful)](#practical-guides-for-prompting-helpful)

         * [Alignment Efforts of Open-source Communtity](#alignment-efforts-of-open-source-communtity)

   * [Usage and Restractions (Models and Data)](#Usage-and-Restrictions)

## Practical Guide for Models

### BERT-style Language Models: Encoder-Decoder or Encoder-only

- BERT **BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding**, 2018, [Paper](https://aclanthology.org/N19-1423.pdf)

- RoBERTa **RoBERTa: A Robustly Optimized BERT Pretraining Approach**, 2019, [Paper](https://arxiv.org/abs/1907.11692)

- DistilBERT **DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter**, 2019, [Paper](https://arxiv.org/abs/1910.01108)

- ALBERT **ALBERT: A Lite BERT for Self-supervised Learning of Language Representations**, 2019, [Paper](https://arxiv.org/abs/1909.11942)

- UniLM **Unified Language Model Pre-training for Natural Language Understanding and Generation**, 2019 [Paper](https://arxiv.org/abs/1905.03197)

- ELECTRA **ELECTRA: PRE-TRAINING TEXT ENCODERS AS DISCRIMINATORS RATHER THAN GENERATORS**, 2020, [Paper](https://openreview.net/pdf?id=r1xMH1BtvB)

- T5 **"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"**. *Colin Raffel et al.* JMLR 2019. [Paper](https://arxiv.org/abs/1910.10683)

- GLM **"GLM-130B: An Open Bilingual Pre-trained Model"**. 2022. [Paper](https://arxiv.org/abs/2210.02414)

- AlexaTM **"AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model"**. *Saleh Soltan et al.* arXiv 2022. [Paper](https://arxiv.org/abs/2208.01448)

- ST-MoE **ST-MoE: Designing Stable and Transferable Sparse Expert Models**. 2022 [Paper](https://arxiv.org/abs/2202.08906)

### GPT-style Language Models: Decoder-only

- GPT **Improving Language Understanding by Generative Pre-Training**. 2018. [Paper](https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf)

- GPT-2 **Language Models are Unsupervised Multitask Learners**. 2018. [Paper](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf)

- GPT-3 **"Language Models are Few-Shot Learners"**. NeurIPS 2020. [Paper](https://arxiv.org/abs/2005.14165)

- OPT **"OPT: Open Pre-trained Transformer Language Models"**. 2022. [Paper](https://arxiv.org/abs/2205.01068)

- PaLM **"PaLM: Scaling Language Modeling with Pathways"**. *Aakanksha Chowdhery et al.* arXiv 2022. [Paper](https://arxiv.org/abs/2204.02311)

- BLOOM  **"BLOOM: A 176B-Parameter Open-Access Multilingual Language Model"**. 2022. [Paper](https://arxiv.org/abs/2211.05100)

- MT-NLG **"Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model"**. 2021. [Paper](https://arxiv.org/abs/2201.11990)

- GLaM **"GLaM: Efficient Scaling of Language Models with Mixture-of-Experts"**. ICML 2022. [Paper](https://arxiv.org/abs/2112.06905)

- Gopher **"Scaling Language Models: Methods, Analysis & Insights from Training Gopher"**. 2021. [Paper](http://arxiv.org/abs/2112.11446v2)

- chinchilla **"Training Compute-Optimal Large Language Models"**. 2022. [Paper](https://arxiv.org/abs/2203.15556)

- LaMDA **"LaMDA: Language Models for Dialog Applications"**. 2021. [Paper](https://arxiv.org/abs/2201.08239)

- LLaMA **"LLaMA: Open and Efficient Foundation Language Models"**. 2023. [Paper](https://arxiv.org/abs/2302.13971v1)

- GPT-4 **"GPT-4 Technical Report"**. 2023. [Paper](http://arxiv.org/abs/2303.08774v2)

- BloombergGPT **BloombergGPT: A Large Language Model for Finance**, 2023, [Paper](https://arxiv.org/abs/2303.17564)

- GPT-NeoX-20B: **"GPT-NeoX-20B: An Open-Source Autoregressive Language Model"**. 2022. [Paper](https://arxiv.org/abs/2204.06745)

- PaLM 2: **"PaLM 2 Technical Report"**. 2023. [Tech.Report](https://arxiv.org/abs/2305.10403)

- LLaMA 2: **"Llama 2: Open foundation and fine-tuned chat models"**. 2023. [Paper](https://arxiv.org/pdf/2307.09288)

- Claude 2: **"Model Card and Evaluations for Claude Models"**. 2023. [Model Card](https://www-files.anthropic.com/production/images/Model-Card-Claude-2.pdf)

## Practical Guide for Data

### Pretraining data

- **RedPajama**, 2023. [Repo](https://github.com/togethercomputer/RedPajama-Data)

- **The Pile: An 800GB Dataset of Diverse Text for Language Modeling**, Arxiv 2020. [Paper](https://arxiv.org/abs/2101.00027)

- **How does the pre-training objective affect what large language models learn about linguistic properties?**, ACL 2022. [Paper](https://aclanthology.org/2022.acl-short.16/)

- **Scaling laws for neural language models**, 2020. [Paper](https://arxiv.org/abs/2001.08361)

- **Data-centric artificial intelligence: A survey**, 2023. [Paper](https://arxiv.org/abs/2303.10158)

- **How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources**, 2022. [Blog](https://yaofu.notion.site/How-does-GPT-Obtain-its-Ability-Tracing-Emergent-Abilities-of-Language-Models-to-their-Sources-b9a57ac0fcf74f30a1ab9e3e36fa1dc1)

### Finetuning data

- **Benchmarking zero-shot text classification: Datasets, evaluation and entailment approach**, EMNLP 2019. [Paper](https://arxiv.org/abs/1909.00161)

- **Language Models are Few-Shot Learners**, NIPS 2020. [Paper](https://proceedings.neurips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html)

- **Does Synthetic Data Generation of LLMs Help Clinical Text Mining?** Arxiv 2023 [Paper](https://arxiv.org/abs/2303.04360)

### Test data/user data

- **Shortcut learning of large language models in natural language understanding: A survey**, Arxiv 2023. [Paper](https://arxiv.org/abs/2208.11857)

- **On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective** Arxiv, 2023. [Paper](https://arxiv.org/abs/2302.12095)

- **SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems** Arxiv 2019. [Paper](https://arxiv.org/abs/1905.00537)

## Practical Guide for NLP Tasks

We build a decision flow for choosing LLMs or fine-tuned models~\protect\footnotemark for user's NLP applications. The decision flow helps users assess whether their downstream NLP applications at hand meet specific conditions and, based on that evaluation, determine whether LLMs or fine-tuned models are the most suitable choice for their applications.



  



### Traditional NLU tasks

- **A benchmark for toxic comment classification on civil comments dataset** Arxiv 2023 [Paper](https://arxiv.org/abs/2301.11125)

- **Is chatgpt a general-purpose natural language processing task solver?** Arxiv 2023[Paper](https://arxiv.org/abs/2302.06476)

- **Benchmarking large language models for news summarization** Arxiv 2022 [Paper](https://arxiv.org/abs/2301.13848)

### Generation tasks

- **News summarization and evaluation in the era of gpt-3** Arxiv 2022 [Paper](https://arxiv.org/abs/2209.12356)

- **Is chatgpt a good translator? yes with gpt-4 as the engine** Arxiv 2023 [Paper](https://arxiv.org/abs/2301.08745)

- **Multilingual machine translation systems from Microsoft for WMT21 shared task**, WMT2021 [Paper](https://aclanthology.org/2021.wmt-1.54/)

- **Can ChatGPT understand too? a comparative study on chatgpt and fine-tuned bert**, Arxiv 2023, [Paper](https://arxiv.org/pdf/2302.10198.pdf)

### Knowledge-intensive tasks

- **Measuring massive multitask language understanding**, ICLR 2021 [Paper](https://arxiv.org/abs/2009.03300)

- **Beyond the imitation game: Quantifying and extrapolating the capabilities of language models**, Arxiv 2022 [Paper](https://arxiv.org/abs/2206.04615)

- **Inverse scaling prize**, 2022 [Link](https://github.com/inverse-scaling/prize)

- **Atlas: Few-shot Learning with Retrieval Augmented Language Models**, Arxiv 2022 [Paper](https://arxiv.org/abs/2208.03299)

- **Large Language Models Encode Clinical Knowledge**, Arxiv 2022 [Paper](https://arxiv.org/abs/2212.13138)

### Abilities with Scaling

- **Training Compute-Optimal Large Language Models**, NeurIPS 2022 [Paper](https://openreview.net/pdf?id=iBBcRUlOAPR)

- **Scaling Laws for Neural Language Models**, Arxiv 2020 [Paper](https://arxiv.org/abs/2001.08361)

- **Solving math word problems with process- and outcome-based feedback**, Arxiv 2022 [Paper](https://arxiv.org/abs/2211.14275)

- **Chain of thought prompting elicits reasoning in large language models**, NeurIPS 2022 [Paper](https://arxiv.org/abs/2201.11903)

- **Emergent abilities of large language models**, TMLR 2022 [Paper](https://arxiv.org/abs/2206.07682)

- **Inverse scaling can become U-shaped**, Arxiv 2022 [Paper](https://arxiv.org/abs/2211.02011)

- **Towards Reasoning in Large Language Models: A Survey**, Arxiv 2022 [Paper](https://arxiv.org/abs/2212.10403)

### Specific tasks

- **Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks**, Arixv 2022 [Paper](https://arxiv.org/abs/2208.10442)

- **PaLI: A Jointly-Scaled Multilingual Language-Image Model**, Arxiv 2022 [Paper](https://arxiv.org/abs/2209.06794)

- **AugGPT: Leveraging ChatGPT for Text Data Augmentation**, Arxiv 2023 [Paper](https://arxiv.org/abs/2302.13007)

- **Is gpt-3 a good data annotator?**, Arxiv 2022 [Paper](https://arxiv.org/abs/2212.10450)

- **Want To Reduce Labeling Cost? GPT-3 Can Help**, EMNLP findings 2021 [Paper](https://aclanthology.org/2021.findings-emnlp.354/)

- **GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation**, EMNLP findings 2021 [Paper](https://aclanthology.org/2021.findings-emnlp.192/)

- **LLM for Patient-Trial Matching: Privacy-Aware Data Augmentation Towards Better Performance and Generalizability**, Arxiv 2023 [Paper](https://arxiv.org/abs/2303.16756)

- **ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks**, Arxiv 2023 [Paper](https://arxiv.org/abs/2303.15056)

- **G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment**, Arxiv 2023 [Paper](https://arxiv.org/abs/2303.16634)

- **GPTScore: Evaluate as You Desire**, Arxiv 2023 [Paper](https://arxiv.org/abs/2302.04166)

- **Large Language Models Are State-of-the-Art Evaluators of Translation Quality**, Arxiv 2023 [Paper](https://arxiv.org/abs/2302.14520)

- **Is ChatGPT a Good NLG Evaluator? A Preliminary Study**, Arxiv 2023 [Paper](https://arxiv.org/abs/2303.04048)

### Real-World ''Tasks''

- **Sparks of Artificial General Intelligence: Early experiments with GPT-4**, Arxiv 2023 [Paper](https://arxiv.org/abs/2303.12712)

### Efficiency

1. Cost

- **Openai’s gpt-3 language model: A technical overview**, 2020. [Blog Post](https://lambdalabs.com/blog/demystifying-gpt-3)

- **Measuring the carbon intensity of ai in cloud instances**, FaccT 2022. [Paper](https://dl.acm.org/doi/abs/10.1145/3531146.3533234)

- **In AI, is bigger always better?**, Nature Article 2023. [Article](https://www.nature.com/articles/d41586-023-00641-w)

- **Language Models are Few-Shot Learners**, NeurIPS 2020. [Paper](https://proceedings.neurips.cc/paper_files/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf)

- **Pricing**, OpenAI. [Blog Post](https://openai.com/pricing)

2. Latency

- HELM: **Holistic evaluation of language models**, Arxiv 2022. [Paper](https://arxiv.org/abs/2211.09110)

3. Parameter-Efficient Fine-Tuning

- **LoRA: Low-Rank Adaptation of Large Language Models**, Arxiv 2021. [Paper](https://arxiv.org/abs/2106.09685)

- **Prefix-Tuning: Optimizing Continuous Prompts for Generation**, ACL 2021. [Paper](https://aclanthology.org/2021.acl-long.353/)

- **P-Tuning: Prompt Tuning Can Be Comparable to Fine-tuning Across Scales and Tasks**, ACL 2022. [Paper](https://aclanthology.org/2022.acl-short.8/)

- **P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks**, Arxiv 2022. [Paper](https://arxiv.org/abs/2110.07602)

4. Pretraining System

- **ZeRO: Memory Optimizations Toward Training Trillion Parameter Models**, Arxiv 2019. [Paper](https://arxiv.org/abs/1910.02054)

- **Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism**, Arxiv 2019. [Paper](https://arxiv.org/abs/1910.02054)

- **Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM**, Arxiv 2021. [Paper](https://arxiv.org/abs/2104.04473)

- **Reducing Activation Recomputation in Large Transformer Models**, Arxiv 2021. [Paper](https://arxiv.org/abs/2104.04473)

### Trustworthiness

1. Robustness and Calibration

- **Calibrate before use: Improving few-shot performance of language models**, ICML 2021. [Paper](http://proceedings.mlr.press/v139/zhao21c.html)

- **SPeC: A Soft Prompt-Based Calibration on Mitigating Performance Variability in Clinical Notes Summarization**, Arxiv 2023. [Paper](https://arxiv.org/abs/2303.13035)

  

2. Spurious biases

- **Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context Learning**, Findings of ACL 2023 [Paper](https://aclanthology.org/2023.findings-acl.284/)

- **Shortcut learning of large language models in natural language understanding: A survey**, 2023 [Paper](https://arxiv.org/abs/2208.11857)

- **Mitigating gender bias in captioning system**, WWW 2020 [Paper](https://dl.acm.org/doi/abs/10.1145/3442381.3449950)

- **Calibrate Before Use: Improving Few-Shot Performance of Language Models**, ICML 2021 [Paper](https://arxiv.org/abs/2102.09690)

- **Shortcut Learning in Deep Neural Networks**, Nature Machine Intelligence 2020 [Paper](https://www.nature.com/articles/s42256-020-00257-z)

- **Do Prompt-Based Models Really Understand the Meaning of Their Prompts?**, NAACL 2022 [Paper](https://aclanthology.org/2022.naacl-main.167/)

  

3. Safety issues

- **GPT-4 System Card**, 2023 [Paper](https://cdn.openai.com/papers/gpt-4-system-card.pdf)

- **The science of detecting llm-generated texts**, Arxiv 2023 [Paper](https://arxiv.org/pdf/2303.07205.pdf)

- **How stereotypes are shared through language: a review and introduction of the aocial categories and stereotypes communication (scsc) framework**, Review of Communication Research, 2019 [Paper](https://research.vu.nl/en/publications/how-stereotypes-are-shared-through-language-a-review-and-introduc)

- **Gender shades: Intersectional accuracy disparities in commercial gender classification**, FaccT 2018 [Paper](https://proceedings.mlr.press/v81/buolamwini18a/buolamwini18a.pdf)

### Benchmark Instruction Tuning

- FLAN: **Finetuned Language Models Are Zero-Shot Learners**, Arxiv 2021 [Paper](https://arxiv.org/abs/2109.01652)

- T0: **Multitask Prompted Training Enables Zero-Shot Task Generalization**, Arxiv 2021 [Paper](https://arxiv.org/abs/2110.08207)

- **Cross-task generalization via natural language crowdsourcing instructions**, ACL 2022 [Paper](https://aclanthology.org/2022.acl-long.244.pdf)

- Tk-INSTRUCT: **Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks**, EMNLP 2022 [Paper](https://aclanthology.org/2022.emnlp-main.340/)

- FLAN-T5/PaLM: **Scaling Instruction-Finetuned Language Models**, Arxiv 2022 [Paper](https://arxiv.org/abs/2210.11416)

- **The Flan Collection: Designing Data and Methods for Effective Instruction Tuning**, Arxiv 2023 [Paper](https://arxiv.org/abs/2301.13688)

- **OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization**, Arxiv 2023 [Paper](https://arxiv.org/abs/2212.12017)

### Alignment

- **Deep Reinforcement Learning from Human Preferences**, NIPS 2017 [Paper](https://arxiv.org/abs/1706.03741)

- **Learning to summarize from human feedback**, Arxiv 2020 [Paper](https://arxiv.org/abs/2009.01325)

- **A General Language Assistant as a Laboratory for Alignment**, Arxiv 2021 [Paper](https://arxiv.org/abs/2112.00861)

- **Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback**, Arxiv 2022 [Paper](https://arxiv.org/abs/2204.05862)

- **Teaching language models to support answers with verified quotes**, Arxiv 2022 [Paper](https://arxiv.org/abs/2203.11147)

- InstructGPT: **Training language models to follow instructions with human feedback**, Arxiv 2022 [Paper](https://arxiv.org/abs/2203.02155)

- **Improving alignment of dialogue agents via targeted human judgements**, Arxiv 2022 [Paper](https://arxiv.org/abs/2209.14375)

- **Scaling Laws for Reward Model Overoptimization**, Arxiv 2022 [Paper](https://arxiv.org/abs/2210.10760)

- Scalable Oversight: **Measuring Progress on Scalable Oversight for Large Language Models**, Arxiv 2022 [Paper](https://arxiv.org/pdf/2211.03540.pdf)

#### Safety Alignment (Harmless)

- **Red Teaming Language Models with Language Models**, Arxiv 2022 [Paper](https://arxiv.org/abs/2202.03286)

- **Constitutional ai: Harmlessness from ai feedback**, Arxiv 2022 [Paper](https://arxiv.org/abs/2212.08073)

- **The Capacity for Moral Self-Correction in Large Language Models**, Arxiv 2023 [Paper](https://arxiv.org/abs/2302.07459)

- **OpenAI: Our approach to AI safety**, 2023 [Blog](https://openai.com/blog/our-approach-to-ai-safety)

#### Truthfulness Alignment (Honest)

- **Reinforcement Learning for Language Models**, 2023 [Blog](https://gist.github.com/yoavg/6bff0fecd65950898eba1bb321cfbd81)

#### Practical Guides for Prompting (Helpful)

- **OpenAI Cookbook**. [Blog](https://github.com/openai/openai-cookbook/blob/main/techniques_to_improve_reliability.md)

- **Prompt Engineering**. [Blog](https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/)

- **ChatGPT Prompt Engineering for Developers!** [Course](https://www.deeplearning.ai/short-courses/chatgpt-prompt-engineering-for-developers/)

#### Alignment Efforts of Open-source Communtity

- **Self-Instruct: Aligning Language Model with Self Generated Instructions**, Arxiv 2022 [Paper](https://arxiv.org/abs/2212.10560)

- **Alpaca**. [Repo](https://github.com/tatsu-lab/stanford_alpaca)

- **Vicuna**. [Repo](https://github.com/lm-sys/FastChat)

- **Dolly**. [Blog](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm)

- **DeepSpeed-Chat**. [Blog](https://github.com/microsoft/DeepSpeedExamples/tree/master/applications/DeepSpeed-Chat)

- **GPT4All**. [Repo](https://github.com/nomic-ai/gpt4all)

- **OpenAssitant**. [Repo](https://github.com/LAION-AI/Open-Assistant)

- **ChatGLM**. [Repo](https://github.com/THUDM/ChatGLM-6B)

- **MOSS**. [Repo](https://github.com/OpenLMLab/MOSS)

- **Lamini**. [Repo](https://github.com/lamini-ai/lamini/)/[Blog](https://lamini.ai/blog/introducing-lamini)

## Usage and Restrictions

We build a table summarizing the LLMs usage restrictions (e.g. for commercial and research purposes). In particular, we provide the information from the models and their pretraining data's perspective.

We urge the users in the community to refer to the licensing information for public models and data and use them in a responsible manner.

We urge the developers to pay special attention to licensing, make them transparent and comprehensive, to prevent any unwanted and unforeseen usage.

    LLMs

    Model

    

    

    Data

    

    

    

     

    License

    Commercial Use

    Other noteable restrictions

    License

    Corpus

    

    

        Encoder-only

    

    

    BERT series of models (general domain)

    Apache 2.0

    ✅

     

    Public

    BooksCorpus, English Wikipedia

    

    

    RoBERTa

    MIT license

    ✅

     

    Public

    BookCorpus, CC-News, OpenWebText, STORIES

    

    

    ERNIE

    Apache 2.0

    ✅

     

    Public

    English Wikipedia

    

    

    SciBERT

    Apache 2.0

    ✅

     

    Public

    BERT corpus, 1.14M papers from Semantic Scholar

    

    

    LegalBERT

    CC BY-SA 4.0

    ❌

     

    Public (except data from the Case Law Access Project)

    EU legislation,  US court cases, etc.

    

    

    BioBERT

    Apache 2.0

    ✅

     

    PubMed

    PubMed, PMC

    

    

        Encoder-Decoder

    

    

    T5

    Apache 2.0

    ✅

     

    Public

    C4

    

    

    Flan-T5

    Apache 2.0

    ✅

     

    Public

    C4, Mixture of tasks (Fig 2 in paper)

    

    

    BART

    Apache 2.0

    ✅

     

    Public

    RoBERTa corpus 

    

    

    GLM

    Apache 2.0

    ✅

     

    Public

    BooksCorpus and English Wikipedia

    

    

    ChatGLM

    ChatGLM License

    ❌

    No use for illegal purposes or military research, no harm the public interest of society

    N/A

    1T tokens of Chinese and English corpus

    

    

        Decoder-only

    

    GPT2 

    Modified MIT License

    ✅

    Use GPT-2 responsibly and clearly indicate your content was created using GPT-2.

    Public

    WebText

    

    

    GPT-Neo

    MIT license

    ✅

     

    Public

    Pile

    

    

    GPT-J

    Apache 2.0

    ✅

     

    Public

    Pile

    

    

    ---> Dolly

    CC BY NC 4.0

    ❌

     

    CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI

    Pile, Self-Instruct

    

    

    ---> GPT4ALL-J

    Apache 2.0

    ✅

     

    Public

    GPT4All-J dataset

    

    

    Pythia

    Apache 2.0

    ✅

     

    Public

    Pile

    

    

    ---> Dolly v2

    MIT license

    ✅

     

    Public

    Pile, databricks-dolly-15k

    

    

    OPT

    OPT-175B LICENSE AGREEMENT

    ❌

    No development relating to surveillance research and military, no harm the public interest of society

    Public

    RoBERTa corpus, the Pile, PushShift.io Reddit

    

    

    ---> OPT-IML

    OPT-175B LICENSE AGREEMENT

    ❌

    same to OPT

    Public

    OPT corpus, Extended version of Super-NaturalInstructions

    

    

    YaLM

    Apache 2.0

    ✅

     

    Unspecified

    Pile, Teams collected Texts in Russian

    

    

    BLOOM

    The BigScience RAIL License

    ✅

    No use of generating verifiably false information with the purpose of harming others; 
content without expressly disclaiming that the text is machine generated

    Public

    ROOTS corpus (Lauren¸con et al., 2022)

    

    

    ---> BLOOMZ

    The BigScience RAIL License

    ✅

    same to BLOOM

    Public

    ROOTS corpus, xP3

    

    

    Galactica

    CC BY-NC 4.0

    ❌

     

    N/A

    The Galactica Corpus

    

    

    LLaMA

    Non-commercial bespoke license

    ❌

    No development relating to surveillance research and military, no harm the public interest of society

    Public

    CommonCrawl, C4, Github, Wikipedia, etc.

    

    

    ---> Alpaca

    CC BY NC 4.0

    ❌

     

    CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI

    LLaMA corpus, Self-Instruct

    

    

    ---> Vicuna

    CC BY NC 4.0

    ❌

     

    Subject to terms of Use of the data generated by OpenAI; 
Privacy Practices of ShareGPT

    LLaMA corpus, 70K conversations from ShareGPT.com

    

    

    ---> GPT4ALL

    GPL Licensed LLaMa

    ❌

     

    Public

    GPT4All dataset

    

    

    OpenLLaMA

    Apache 2.0

    ✅

     

    Public

    RedPajama

    

    

    CodeGeeX

    The CodeGeeX License

    ❌

    No use for illegal purposes or military research

    Public

    Pile, CodeParrot, etc.

    

    

    StarCoder

    BigCode OpenRAIL-M v1 license

    ✅

    No use of generating verifiably false information with the purpose of harming others; 
content without expressly disclaiming that the text is machine generated

    Public

    The Stack

    

    MPT-7B

    Apache 2.0

    ✅

     

    Public

    mC4 (english), The Stack, RedPajama, S2ORC

    

        falcon

        TII Falcon LLM License

        ✅/❌

        Available under a license allowing commercial use

        Public

        RefinedWeb

    

    

## Star History

[![Star History Chart](https://api.star-history.com/svg?repos=Mooler0410/LLMsPracticalGuide&type=Date)](https://star-history.com/#Mooler0410/LLMsPracticalGuide&Date)