Reflexion: Language Agents with Verbal Reinforcement Learning

Authors: Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao (2023)

Domains

Applications

TLDR (English)

Makes agent do natural language "post-mortem" after failure, injecting reflection into next round's prompt. "Gradient-free self-improvement" approach widely reused in coding agents, SWE-agent.

TLDR（中文）

让 agent 在失败后用自然语言做"复盘"，下一轮把反思塞进 prompt。"无梯度的自我改进"思路被广泛复用于 coding agent、SWE-agent。

Appears in These Articles

Agent 与工具使用：模型不只是聊天
Agents and Tool Use: Models Are More Than Chat

Co-cited Papers

These papers appear in the same articles as this one

Related Papers

Other papers in the same domain