chen2023-spec-sampling
arXiv: 2302.01318
TLDR(中文)
DeepMind 同期独立提出 speculative sampling,理论上证明可在保持采样分布不变的前提下加速。和 Leviathan 一起为这条路线定调;另见 Medusa、EAGLE 等后续。
TLDR (English)
DeepMind's concurrent independent proposal of speculative sampling, theoretically proving acceleration while preserving sampling distribution. Together with Leviathan sets the direction; see also later Medusa, EAGLE.