Bayesian teaching enables probabilistic reasoning in large language models

Qiu, Linlu; Sha, Fei; Allen, Kelsey R.; Kim, Yoon; Linzen, Tal; Steenkiste, Sjoerd van

doi:10.1038/s41467-025-67998-6

articleNature CommunicationsJan 7, 2026GOLD OA

Bayesian teaching enables probabilistic reasoning in large language models

LQLinlu Qiu FSFei Sha KRKelsey R. Allen YKYoon Kim TLTal Linzen

Massachusetts Institute of Technology · Menlo School · +3 more institutions

PubMed

Indexed inarxivcrossrefdoajpubmed

Abstract

Large language models (LLMs) are increasingly used as agents that interact with users and with the world. To do so successfully, LLMs must construct representations of the world and form probabilistic beliefs about them. To provide personalized recommendations, for example, the LLM needs to infer a user’s preferences from their behavior over multiple interactions. The Bayesian inference framework lays out the optimal way for an agent to update its beliefs as it receives new information. We first show that LLMs fall far short of the standard defined by the Bayesian framework. We then show that by teaching LLMs to mimic the predictions of the normative Bayesian model, we can dramatically improve their ability to…

Citation impact

5

total citations

FWCI: 134.31
Percentile: 100%
References: 31

Too recent for citation history.

Authors

6

Topics & keywords

Topics

Keywords

Probabilistic logic
Normative
Construct (python library)
Inference
Bayesian inference
Bayesian probability

UN Sustainable Development Goals

Quality Education

No related works found for this paper.