Self-Refine: Iterative Refinement with Self-Feedback
Indexed inarxivdatacite
Abstract
Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an initial output using an LLMs; then, the same LLMs provides feedback for its output and uses it to refine itself, iteratively. Self-Refine does not require any supervised training data, additional training, or reinforcement learning, and instead uses a single LLM as the generator, refiner, and feedback provider. We evaluate Self-Refine across 7 diverse tasks, ranging from dialog response generation to…
Citation impact
208
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
16Topics & keywords
Keywords
- Task (project management)
- Computer science
- Reinforcement learning
- Generator (circuit theory)
- Dialog box
- Iterative learning control
- Artificial intelligence
- Machine learning
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.