touvron2023-llama2
arXiv: 2307.09288
TLDR(中文)
第一个商用许可的高质量开源 chat 模型,并公开了 RLHF 配方(PPO + GAtt)。直接把开源生态推进到"接近 ChatGPT 体验"的阶段。
TLDR (English)
First commercially licensed high-quality open-source chat model, publicly shares RLHF recipe (PPO + GAtt). Directly advances open-source ecosystem to "near ChatGPT experience" stage.