Self-Refine: Iterative Refinement with Self-Feedback

Madaan, Aman; Tandon, Niket; Gupta, Prakhar; Hallinan, Skyler; Gao, Luyu; Wiegreffe, Sarah; Alon, Uri; Dziri, Nouha; Prabhumoye, Shrimai; Yang, Yiming; Gupta, Shashank; Majumder, Bodhisattwa Prasad; Katherine, Hermann,; Welleck, Sean; Yazdanbakhsh, Amir; Clark, Peter

doi:10.48550/arxiv.2303.17651

preprintarXiv (Cornell University)Mar 30, 2023GREEN OA

Self-Refine: Iterative Refinement with Self-Feedback

AMAman Madaan NTNiket Tandon PGPrakhar Gupta SHSkyler Hallinan LGLuyu Gao

Indexed inarxivdatacite

Abstract

Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an initial output using an LLMs; then, the same LLMs provides feedback for its output and uses it to refine itself, iteratively. Self-Refine does not require any supervised training data, additional training, or reinforcement learning, and instead uses a single LLM as the generator, refiner, and feedback provider. We evaluate Self-Refine across 7 diverse tasks, ranging from dialog response generation to…

Citation impact

208

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

16

Topics & keywords

Topics

Keywords

Task (project management)
Computer science
Reinforcement learning
Generator (circuit theory)
Dialog box
Iterative learning control
Artificial intelligence
Machine learning

UN Sustainable Development Goals

Quality Education

No related works found for this paper.