跳转到内容

stiennon2020-summarize

arXiv: 2009.01325

TLDR(中文)

OpenAI 把 RLHF 第一次用到大型语言模型(摘要),证明 RLHF 比 SFT/MLE 在人类偏好上系统性更好。是 InstructGPT 的直接前身。

TLDR (English)

OpenAI's first application of RLHF to large language models (summarization), proving RLHF systematically better than SFT/MLE on human preferences. Direct predecessor to InstructGPT.