The Illusion of Thinking

Apple (Israel) · Apple (United States) · +1 more institution

Indexed incrossref

Abstract

Recent generations of frontier language models have introduced Large Reasoning Models (LRMs) that generate detailed thinking processes before providing answers. While these models demonstrate improved performance on reasoning benchmarks, their fundamental capabilities, scaling properties, and limitations remain insufficiently understood. Current evaluations primarily focus on established mathematical and coding benchmarks, emphasizing final answer accuracy. However, this evaluation paradigm often suffers from data contamination and does not provide insights into the reasoning traces’ structure and quality. In this work, we systematically investigate these gaps with the help of controllable puzzle environments…

Citation impact

109
total citations
FWCI
207.72
Percentile
100%
References
0
Citations per year

Authors

6

Topics & keywords

Keywords
  • Inference
  • Counterintuitive
  • Focus (optics)
  • Analytic reasoning
  • Scaling
  • Qualitative reasoning
  • Illusion
  • Visual reasoning
No related works found for this paper.