Efficient Guided Generation for Large Language Models
arXiv: 2307.09702
领域
TLDR(中文)
提出高效的约束解码方法,让大语言模型在生成过程中实时遵守 JSON Schema、正则表达式或上下文无关文法。通过将语法约束转化为有限状态自动机,在几乎不增加延迟的情况下保证输出格式正确。
TLDR (English)
Proposes efficient constrained decoding that enforces JSON Schema, regular expressions, or context-free grammars during generation. Converts syntax constraints into finite-state automata, guaranteeing correct output format with minimal latency overhead.
相关论文
同一领域的其他论文