Skip to content

shinn2023-reflexion

arXiv: 2303.11366

TLDR (English)

Makes agent do natural language "post-mortem" after failure, injecting reflection into next round's prompt. "Gradient-free self-improvement" approach widely reused in coding agents, SWE-agent.

TLDR(中文)

让 agent 在失败后用自然语言做"复盘",下一轮把反思塞进 prompt。"无梯度的自我改进"思路被广泛复用于 coding agent、SWE-agent。