preprintarXiv (Cornell University)Mar 21, 2022GREEN OA

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Indexed inarxivdatacite

Abstract

Chain-of-thought prompting combined with pre-trained large language models has achieved encouraging results on complex reasoning tasks. In this paper, we propose a new decoding strategy, self-consistency, to replace the naive greedy decoding used in chain-of-thought prompting. It first samples a diverse set of reasoning paths instead of only taking the greedy one, and then selects the most consistent answer by marginalizing out the sampled reasoning paths. Self-consistency leverages the intuition that a complex reasoning problem typically admits multiple different ways of thinking leading to its unique correct answer. Our extensive empirical evaluation shows that self-consistency boosts the performance of…

Citation impact

688
total citations
FWCI
Percentile
References
0
Citations per year

Authors

8

Topics & keywords

Keywords
  • Intuition
  • Consistency (knowledge bases)
  • Commonsense reasoning
  • Computer science
  • Decoding methods
  • Margin (machine learning)
  • Artificial intelligence
  • Set (abstract data type)
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.