Self-Consistency Improves Chain of Thought Reasoning in Language Models

Wang, Xuezhi; Jason, Wei,; Schuurmans, Dale; Le, Quoc V.; H., Ed; Sharan, Narang,; Aakanksha, Chowdhery,; Denny, Zhou,

doi:10.48550/arxiv.2203.11171

preprintarXiv (Cornell University)Mar 21, 2022GREEN OA

Self-Consistency Improves Chain of Thought Reasoning in Language Models

XWXuezhi WangWJWei, JasonDSDale Schuurmans QVQuoc V. Le EHEd H.

Indexed inarxivdatacite

Abstract

Chain-of-thought prompting combined with pre-trained large language models has achieved encouraging results on complex reasoning tasks. In this paper, we propose a new decoding strategy, self-consistency, to replace the naive greedy decoding used in chain-of-thought prompting. It first samples a diverse set of reasoning paths instead of only taking the greedy one, and then selects the most consistent answer by marginalizing out the sampled reasoning paths. Self-consistency leverages the intuition that a complex reasoning problem typically admits multiple different ways of thinking leading to its unique correct answer. Our extensive empirical evaluation shows that self-consistency boosts the performance of…

Citation impact

688

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

8

Topics & keywords

Topics

Keywords

Intuition
Consistency (knowledge bases)
Commonsense reasoning
Computer science
Decoding methods
Margin (machine learning)
Artificial intelligence
Set (abstract data type)

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.