awesome-llm-constrained-decoding
A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.
https://github.com/saibo-creator/awesome-llm-constrained-decoding
Last synced: 4 days ago
JSON representation
-
Benchmark & Datasets & Evaluation
- JSON-mode Eval dataset
- JSON-mode Eval dataset
- Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
- NLV corpus
- Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-SQL task
- JSON-mode Eval Cleaned/Extended, SMILES Eval and HumanEval MRI C++
- JsonSchemaBench: Generating Structured Outputs from Language Models: Benchmark and Studies
- COLLIE: Systematic Construction of Constrained Text Generation Tasks
- BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing
- Evaluating Large Language Models on Controlled Generation Tasks
- CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning
-
Blog Posts
- Coding For Structured Generation with LLMs
- Beating GPT-4 with Open Source
- Prompt Efficiency - Using Structured Generation to get 8-shot performance from 1-shot.
- How fast can grammar-structured generation be?
- Coding For Structured Generation with LLMs
- Prompt Efficiency - Using Structured Generation to get 8-shot performance from 1-shot.
- Beating GPT-4 with Open Source
- Structured Generation Improves LLM performance: GSM8K Benchmark
- Coalescence: making LLM inference 5x faster
- Constrained Decoding with Arbitrary Constraints is NP-hard
- LLMs are bad at returning code in JSON
- Say what I mean
- Outlines
- The good, the bad, and the ugly of Gemini’s structured outputs
- Leveraging Constrained Sampling for Fill-in-the-Middle Code Completion
- LLM Decoding with Regex Constraints
- Constrained Decoding is Posterior Inference - creator
- Making Structured Generation Faster Than Unstructured
- LLGuidance: Making Structured Outputs Go Brrr
- Proper Well-Formedness for Finite LLM Sampling
-
Papers
- PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
- Constrained Language Models Yield Few-Shot Semantic Parsers
- Controlled Text Generation as Continuous Optimization with Multiple Constraints
- NEUROLOGIC DECODING:(Un)supervised Neural Text Generation with Predicate Logic Constraints
- A General-Purpose Algorithm for Constrained Sequential Inference
- Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting
- CGMH: Constrained Sentence Generation by Metropolis-Hastings Sampling
- Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation
- Constrained Decoding of Diffusion LLMs with Context-Free Grammars
- Earley-Driven Dynamic Pruning for Efficient Structured Decoding
- Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling
- Flexible and Efficient Grammar-Constrained Decoding
- Automata-based constraints for language model decoding
- Lazy-k Decoding: Constrained Decoding for Information Extraction
- CodePAD: Sequence-based Code Generation with Pushdown Automaton
- Controllable Text Generation with Neurally-Decomposed Oracle
- Gradient-Based Constrained Sampling from Language Models
- COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics
- Synchromesh: Reliable code generation from pre-trained language models
- Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting
- KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection
- Efficient Guided Generation for Large Language Models
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models
- Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning
- Prompting Is Programming: A Query Language for Large Language Models
- Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing
- Tractable Control for Autoregressive Language Generation
- Validating Large Language Models with ReLM
- CRANE: Reasoning with constrained LLM generation
- Generating Structured Outputs from Language Models: Benchmark and Studies
- Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation
- Grammar-Constrained Decoding Makes Large Language Models Better Logical Parsers
- GRAMMAR-LLM: Grammar-Constrained Natural Language Generation
- Pre3: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation
- Syntactic Control of Language Models by Posterior Inference
- Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
- Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models
- Type-Constrained Code Generation with Language Models
- Lost in Space: Optimizing Tokens for Grammar-Constrained Decoding
- Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence
- Circuit Transformer: A Transformer That Preserves Logical Equivalence
- XGRAMMAR: FLEXIBLE AND EFFICIENT STRUCTURED GENERATION ENGINE FOR LARGE LANGUAGE MODELS
- IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
- FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-tracked Constrained Decoding and Reranking
- Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access
- Grammar-Aligned Decoding
- SynCode: LLM Generation with Grammar Augmentation
- Constrained Decoding for Cross-lingual Label Projection
- Constrained Decoding for Code Language Models via Efficient Left and Right Quotienting of Context-Sensitive Grammars
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
- SGLang: Efficient Execution of Structured Language Model Programs
- Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context
- Prompt Sketching for Large Language Models
- Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs
- Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
- Amortizing intractable inference in large language models
- Incorporating Discriminator in Sentence Generation: a Gibbs Sampling Method
- Guided Open Vocabulary Image Captioning with Constrained Beam Search
- Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search
- DINGO: Constrained Inference for Diffusion LLMs
- Constrained Language Generation with Discrete Diffusion Models
-
Libraries
- genlm/genlm-control - control) |
- epfl-dlab/transformers-CFG - dlab/transformers-CFG) |
- structuredllm/itergen
- noamgat/lm-format-enforcer - format-enforcer) |
- eth-sri/type-constrained-code-generation - safety. |  |
- outlines-dev/outlines - dev/outlines) |
- guidance-ai/guidance - CPP |  |
- eth-sri/lmql - sri/lmql) |
- jxnl/instructor - Reject-Repeat approach to ensure constraints are met |  |
- microsoft/aici
- mlc-ai/xgrammar - ai/xgrammar) |
- Dan-wanna-M/formatron - wanna-M/formatron) |
- snowkylin/circuit-transformer - transformer) |
- epfl-dlab/jsonschemabench - dlab/jsonschemabench) |
- eth-sri/constrained-diffusion - Region Infilling (CFG) |  |
- grammarllm
-
Survey
-
Related Awesome Lists
Programming Languages
Categories
Sub Categories
Keywords
llm
6
language-model
3
llm-inference
2
large-language-models
2
structured-generation
2
constrained-decoding
2
llmops
1
llm-serving
1
llm-framework
1
inference
1
ai
1
programming-language
1
huggingface
1
chatgpt
1
gpt-actions
1
function-calling
1
awesome-list
1
model-serving
1
rust
1
transformer
1
wasm
1
wasmtime
1
code-synthesis
1
type-systems
1
probabilistic-programming
1
grammar
1
parser
1
diffusion
1
diffusion-model
1
fill-in-the-middle
1
llms
1
llms-benchmarking
1
multi-region-infilling
1