Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine

Savage, Thomas; Nayak, Ashwin; Gallo, Robert; Rangan, Ekanath; Chen, Jonathan H.

doi:10.1038/s41746-024-01010-1

articlenpj Digital MedicineJan 24, 2024GOLD OA

Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine

TSThomas Savage ANAshwin Nayak RGRobert Gallo EREkanath Rangan JHJonathan H. Chen

Stanford Health Care · Stanford Medicine · +1 more institution

PubMed

Indexed incrossrefdoajpubmed

Abstract

One of the major barriers to using large language models (LLMs) in medicine is the perception they use uninterpretable methods to make clinical decisions that are inherently different from the cognitive processes of clinicians. In this manuscript we develop diagnostic reasoning prompts to study whether LLMs can imitate clinical reasoning while accurately forming a diagnosis. We find that GPT-4 can be prompted to mimic the common clinical reasoning processes of clinicians without sacrificing diagnostic accuracy. This is significant because an LLM that can imitate clinical reasoning to provide an interpretable rationale offers physicians a means to evaluate whether an LLMs response is likely correct and can be…

Citation impact

222

total citations

FWCI: 23.73
Percentile: 100%
References: 22

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Interpretability
Perception
Cognition
Personalized medicine
Psychology
Medicine
Cognitive psychology
Computer science

UN Sustainable Development Goals

Quality Education

No related works found for this paper.