preprintarXiv (Cornell University)Mar 30, 2023GREEN OA

Self-Refine: Iterative Refinement with Self-Feedback

Indexed inarxivdatacite

Abstract

Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an initial output using an LLMs; then, the same LLMs provides feedback for its output and uses it to refine itself, iteratively. Self-Refine does not require any supervised training data, additional training, or reinforcement learning, and instead uses a single LLM as the generator, refiner, and feedback provider. We evaluate Self-Refine across 7 diverse tasks, ranging from dialog response generation to…

Citation impact

208
total citations
FWCI
Percentile
References
0
Citations per year

Authors

16

Topics & keywords

Keywords
  • Task (project management)
  • Computer science
  • Reinforcement learning
  • Generator (circuit theory)
  • Dialog box
  • Iterative learning control
  • Artificial intelligence
  • Machine learning
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.