Self-Consistency Improves Chain of Thought Reasoning in Language Models
Indexed inarxivdatacite
Abstract
Chain-of-thought prompting combined with pre-trained large language models has achieved encouraging results on complex reasoning tasks. In this paper, we propose a new decoding strategy, self-consistency, to replace the naive greedy decoding used in chain-of-thought prompting. It first samples a diverse set of reasoning paths instead of only taking the greedy one, and then selects the most consistent answer by marginalizing out the sampled reasoning paths. Self-consistency leverages the intuition that a complex reasoning problem typically admits multiple different ways of thinking leading to its unique correct answer. Our extensive empirical evaluation shows that self-consistency boosts the performance of…
Citation impact
688
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
8Topics & keywords
Keywords
- Intuition
- Consistency (knowledge bases)
- Commonsense reasoning
- Computer science
- Decoding methods
- Margin (machine learning)
- Artificial intelligence
- Set (abstract data type)
UN Sustainable Development Goals
- Reduced inequalities
No related works found for this paper.