wang2022-self-instruct
arXiv: 2212.10560
TLDR (English)
Uses GPT-3 to generate instruction-output data and distill to itself. Stanford Alpaca/Vicuna both based on this, opening "use large models to generate data for training small models" synthetic data era.
TLDR(中文)
用 GPT-3 自己生成指令-输出数据再蒸馏到自己。Stanford Alpaca / Vicuna 都基于这套,开启"用大模型造数据训小模型"的合成数据时代。