Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/tigerchen52/awesome_role_of_small_models

a curated list of the role of small models in the LLM era
https://github.com/tigerchen52/awesome_role_of_small_models

List: awesome_role_of_small_models

Last synced: about 2 months ago
JSON representation

a curated list of the role of small models in the LLM era

Awesome Lists containing this project

README

        

# The Role of Small Models
[![Awesome](https://awesome.re/badge.svg)]()
[![PDF](https://img.shields.io/badge/PDF-2409.06857-green)](https://arxiv.org/abs/2409.06857)
![GitHub License](https://img.shields.io/github/license/tigerchen52/role_of_small_models)
![](https://img.shields.io/badge/PRs-Welcome-red)

This work is ongoing, and we welcome any comments or suggestions.

Please feel free to reach out if you find we have overlooked any relevant papers.


What is the Role of Small Models in the LLM Era: A Survey


Lihu Chen1&nbsp&nbsp
Gaël Varoquaux2&nbsp&nbsp


1 Imperial College London, UK &nbsp&nbsp
2 Soda, Inria Saclay, France &nbsp&nbsp







## Content List
- [Collaboration](#collaboration)
- [SMs Enhance LLMs](#sms-enhance-llms)
- [Data Curation](#data-curation)
- [Curating pre-training data](#curating-pre-training-data)
- [Curating Instruction-tuning Data](#curating-instruction-tuning-data)
- [Weak-to-Strong Paradigm](#weak-to-strong-paradigm)
- [Efficient Inference](#efficient-inference)
- [Ensembling different-size models to reduce inference costs](#ensembling-different-size-models-to-reduce-inference-costs)
- [Speculative Decoding](#speculative-decoding)
- [Evaluating LLMs](#evaluating-llms)
- [Domain Adaptation](#domain-adaptation)
- [Using domain-specific SMs to generate knowledge for LLMs at reasoning time](#using-domain-specific-sms-to-generate-knowledge-for-llms-at-reasoning-time)
- [Using domain-specific SMs to adjust token probability of LLMs at decoding time](#using-domain-specific-sms-to-adjust-token-probability-of-llms-at-decoding-time)
- [Retrieval Augmented Generation](#retrieval-augmented-generation)
- [Prompt-based Reasoning](#prompt-based-reasoning)
- [Deficiency Repair](#deficiency-repair)
- [Developing SM plugins to repair deficiencies](#developing-sm-plugins-to-repair-deficiencies)
- [Contrasting LLMs and SMs for better generations](#contrasting-llms-and-sms-for-better-generations)
- [LLMs Enhance SMs](#llms-enhance-sms)
- [Knowledge Distillation](#knowledge-distillation)
- [Black-box Distillation](#black-box-distillation)
- [White-box distillation](#white-box-distillation)
- [Data Synthesis](#data-synthesis)
- [Data Augmentation](#data-augmentation)
- [Training Data Generation](#training-data-generation)
- [Competition](#competition)
- [Computation-constrained Environment](#computation-constrained-environment)
- [Task-specific Environment](#task-specific-environment)
- [Interpretability-required Environment](#interpretability-required-environment)

# Collaboration


## SMs Enhance LLMs

### Data Curation

#### Curating pre-training data


Title
Topic
Venue
Code


Data selection for language models via importance resampling
Data Selection

PDF Badge


PDF Badge



When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
Data Selection

PDF Badge




DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining
Data Reweighting

PDF Badge


PDF Badge

#### Curating Instruction-tuning Data


Title
Topic
Venue
Code


MoDS: Model-oriented Data Selection for Instruction Tuning
Data Selection

PDF Badge


PDF Badge



LESS: Selecting Influential Data for Targeted Instruction Tuning
Data Selection

PDF Badge


PDF Badge

### Weak-to-Strong Paradigm

#### Using weaker (smaller) models to align stronger (larger) models


Title
Topic
Venue
Code


Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Weak-to-Strong

PDF Badge


PDF Badge

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
Weak-to-Strong

PDF Badge


PDF Badge

Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
Weak-to-Strong

PDF Badge


PDF Badge

Improving Weak-to-Strong Generalization with Reliability-Aware Alignment
Weak-to-Strong

PDF Badge


PDF Badge

Aligner: Efficient Alignment by Learning to Correct
Weak-to-Strong

PDF Badge


PDF Badge

Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Weak-to-Strong

PDF Badge


PDF Badge

### Efficient Inference

#### Ensembling different-size models to reduce inference costs


Title
Topic
Venue
Code


Efficient Edge Inference by Selective Query
Model Cascading

PDF Badge


PDF Badge



FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Model Cascading

PDF Badge




Data Shunt: Collaboration of Small and Large Models for Lower Costs and Better Performance
Model Cascading

PDF Badge


PDF Badge

AutoMix: Automatically Mixing Language Models
Model Cascading

PDF Badge


PDF Badge

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
Model Routing

PDF Badge

Tryage: Real-time, intelligent Routing of User Prompts to Large Language Models
Model Routing

PDF Badge

OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking
Model Routing

PDF Badge

RouteLLM: Learning to Route LLMs with Preference Data
Model Routing

PDF Badge


PDF Badge

Fly-Swat or Cannon? Cost-Effective Language Model Choice via Meta-Modeling
Model Routing

PDF Badge


PDF Badge

#### Speculative Decoding


Title
Topic
Venue
Code


Fast Inference from Transformers via Speculative Decoding
Speculative Decoding

PDF Badge


PDF Badge



Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
Speculative Decoding

PDF Badge


PDF Badge

Accelerating Large Language Model Decoding with Speculative Sampling
Speculative Decoding

PDF Badge


PDF Badge

### Evaluating LLMs

#### Using SMs to evaluate LLM's generations


Title
Topic
Venue
Code


BERTScore: Evaluating Text Generation with BERT
General Evaluation

PDF Badge


PDF Badge



BARTScore: Evaluating Generated Text as Text Generation
General Evaluation

PDF Badge


PDF Badge

Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation
Uncertainty

PDF Badge


PDF Badge

Selfcheckgpt: Zero-resource black-box hallucination detection for generative large language models
Uncertainty

PDF Badge


PDF Badge

ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
Performance Prediction

PDF Badge


PDF Badge

### Domain Adaptation

#### Using domain-specific SMs to adjust token probability of LLMs at decoding time


Title
Topic
Venue
Code


CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models
White-box Domain Adaptation

PDF Badge




Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
White-box Domain Adaptation

PDF Badge


PDF Badge



Tuning Language Models by Proxy
White-box Domain Adaptation

PDF Badge


PDF Badge

#### Using domain-specific SMs to generate knowledge for LLMs at reasoning time


Title
Topic
Venue
Code


Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
Black-box Domain Adaptation

PDF Badge


PDF Badge



BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Black-box Domain Adaptation

PDF Badge


### Retrieval Augmented Generation

#### Using SMs to retrieve knowledge for enhancing generations:


Title
Topic
Venue
Code


Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Documents

PDF Badge




KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases
Knowledge Bases

PDF Badge




End-to-End Table Question Answering via Retrieval-Augmented Generation
Tables

PDF Badge




DocPrompting: Generating Code by Retrieving the Docs
Codes

PDF Badge


PDF Badge



Toolformer: Language Models Can Teach Themselves to Use Tools
Tools

PDF Badge


PDF Badge



Retrieval-Augmented Multimodal Language Modeling
Images

PDF Badge


### Prompt-based Reasoning

#### Using SMs to augment prompts for LLMs


Title
Topic
Venue
Code


UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
Retrieving Prompts

PDF Badge


PDF Badge



Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Decomposing Complex Problems

PDF Badge


PDF Badge



Small Models are Valuable Plug-ins for Large Language Models
Generating Pseudo Labels

PDF Badge


PDF Badge



Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought
Generating Pseudo Labels

PDF Badge




CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation

Generating Feedback

PDF Badge




Small Language Models Improve Giants by Rewriting Their Outputs
Generating Feedback

PDF Badge


PDF Badge

### Deficiency Repair

#### Developing SM plugins to repair deficiencies:


Title
Topic
Venue
Code


Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Hallucinations

PDF Badge


PDF Badge



Reconfidencing LLMs from the Grouping Loss Perspective
Hallucinations

PDF Badge




Imputing Out-of-Vocabulary Embeddings with LOVE Makes LanguageModels Robust with Little Cost
Out-Of-Vocabulary Words

PDF Badge


PDF Badge

#### Contrasting LLMs and SMs for better generations:


Title
Topic
Venue
Code


Contrastive Decoding: Open-ended Text Generation as Optimization
Reducing Repeated Texts

PDF Badge


PDF Badge



Alleviating Hallucinations of Large Language Models through Induced Hallucinations
Mitigating Hallucinations

PDF Badge




Contrastive Decoding Improves Reasoning in Large Language Models
Augmenting Reasoning Capabilities

PDF Badge




CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
Safeguarding Privacy

PDF Badge


## LLMs Enhance SMs

### Knowledge Distillation

#### Black-box Distillation:


Title
Topic
Venue
Code


Explanations from Large Language Models Make Small Reasoners Better
Chain-Of-Thought Distillation

PDF Badge




Distilling Step-by-Step! Outperforming Larger Language Models
with Less Training Data and Smaller Model Sizes
Chain-Of-Thought Distillation

PDF Badge


PDF Badge



Distilling Reasoning Capabilities into Smaller Language Models
Chain-Of-Thought Distillation

PDF Badge


PDF Badge



Teaching Small Language Models to Reason
Chain-Of-Thought Distillation

PDF Badge




Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step

Chain-Of-Thought Distillation

PDF Badge


PDF Badge



Specializing Smaller Language Models towards Multi-Step Reasoning
Chain-Of-Thought Distillation

PDF Badge




TinyLLM: Learning a Small Student from Multiple Large Language Models

Chain-Of-Thought Distillation

PDF Badge




Lion: Adversarial Distillation of Proprietary Large Language Models
Instruction Following Distillation

PDF Badge


PDF Badge



Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Instruction Following Distillation

PDF Badge


PDF Badge

#### White-box Distillation:


Title
Topic
Venue
Code


DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Logits

PDF Badge




ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Intermediate Features

PDF Badge


PDF Badge



Less is More: Task-aware Layer-wise Distillation for Language Model Compression
Intermediate Features

PDF Badge


PDF Badge



MiniLLM: Knowledge Distillation of Large Language Models
Intermediate Features

PDF Badge


PDF Badge



LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Intermediate Features

PDF Badge


### Data Synthesis

#### Data Augmentation:


Title
Topic
Venue
Code


Improving data augmentation for low resource speech-to-text translation with diverse paraphrasing
Text Paraphrase

PDF Badge




Paraphrasing with Large Language Models
Text Paraphrase

PDF Badge




Query Rewriting for Retrieval-Augmented Large Language Models
Query Rewriting

PDF Badge


PDF Badge



LLMvsSmall Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model
Specific Tasks

PDF Badge




Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
Specific Tasks

PDF Badge


PDF Badge



Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Specific Tasks

PDF Badge


#### Training Data Generation:


Title
Topic
Venue
Code


Want To Reduce Labeling Cost? GPT-3 Can Help
Label Annotation

PDF Badge




Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning
Label Annotation

PDF Badge




ZeroGen: Efficient Zero-shot Learning via Dataset Generation
Dataset Generation

PDF Badge


PDF Badge



Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Dataset Generation

PDF Badge


PDF Badge



Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions
Dataset Generation

PDF Badge




Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations
Dataset Generation

PDF Badge




Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Dataset Generation

PDF Badge




Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction
Dataset Generation

PDF Badge


PDF Badge



ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection
Dataset Generation

PDF Badge


PDF Badge

# Competition

## Computation-constrained Environment
## Task-specific Environment
## Interpretability-required Environment

## Citation

```
@misc{chen2024rolesmallmodelsllm,
title={What is the Role of Small Models in the LLM Era: A Survey},
author={Lihu Chen and Gaël Varoquaux},
year={2024},
eprint={2409.06857},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2409.06857},
}
``````