Efficient Guided Generation for Large Language Models

作者： Brandon T. Willard, Remi Louf (2023)

领域

推理应用

TLDR（中文）

提出高效的约束解码方法，让大语言模型在生成过程中实时遵守 JSON Schema、正则表达式或上下文无关文法。通过将语法约束转化为有限状态自动机，在几乎不增加延迟的情况下保证输出格式正确。

TLDR (English)

Proposes efficient constrained decoding that enforces JSON Schema, regular expressions, or context-free grammars during generation. Converts syntax constraints into finite-state automata, guaranteeing correct output format with minimal latency overhead.

出现在这些文章里

Sampling 与 Decoding：从概率到文字
Sampling and Decoding: From Probabilities to Text

同被引用

这些论文与本文出现在同一篇文章中

Efficient Guided Generation for Large Language Models

领域

TLDR（中文）

TLDR (English)

出现在这些文章里

同被引用

相关论文