Skip to content

touvron2023-llama2

arXiv: 2307.09288

TLDR (English)

First commercially licensed high-quality open-source chat model, publicly shares RLHF recipe (PPO + GAtt). Directly advances open-source ecosystem to "near ChatGPT experience" stage.

TLDR(中文)

第一个商用许可的高质量开源 chat 模型,并公开了 RLHF 配方(PPO + GAtt)。直接把开源生态推进到"接近 ChatGPT 体验"的阶段。