Physician- and Large Language Model–Generated Hospital Discharge Summaries

Williams, Christopher Y. K.; Subramanian, Charumathi Raghu; Ali, Syed Salman; Apolinario, Michael; Askin, Elisabeth; Barish, Peter; Cheng, Monica; Deardorff, William James; Donthi, Nisha; Ganeshan, Smitha; Huang, Owen; Kantor, Molly A.; Lai, Andrew; Manchanda, Ashley; Moore, Kendra A.; Muniyappa, Anoop; Nair, Geethu; Patel, Prashant; Santhosh, Lekshmi; Schneider, Susan; Torres, Shawn; Yukawa, Michi; Hubbard, Colin C.; Rosner, Benjamin I.

doi:10.1001/jamainternmed.2025.0821

letterJAMA Internal MedicineMay 5, 2025GREEN OA

Physician- and Large Language Model–Generated Hospital Discharge Summaries

CYChristopher Y. K. Williams CRCharumathi Raghu Subramanian SSSyed Salman Ali MAMichael Apolinario EAElisabeth Askin

University of California, San Francisco · University of California System

PubMed

Indexed incrossrefpubmed

Abstract

Importance

High-quality discharge summaries are associated with improved patient outcomes, but contribute to clinical documentation burden. Large language models (LLMs) provide an opportunity to support physicians by drafting discharge summary narratives.

Objective

To determine whether LLM-generated discharge summary narratives are of comparable quality and safety to those of physicians. Design, Setting, and Participants: This cross-sectional study conducted at the University of California, San Francisco included 100 randomly selected inpatient hospital medicine encounters of 3 to 6 days' duration between 2019 and 2022. The analysis took place in July 2024. Exposure: A blinded evaluation of physician- and LLM-generated narratives was performed in duplicate by 22 attending physician reviewers. Main Outcomes and Measures: Narratives were reviewed for overall quality, reviewer preference, comprehensiveness, concision, coherence, and 3 error types (inaccuracies, omissions, and hallucinations). Each error individually, and each narrative overall, were assigned potential harmfulness scores ranging from 0 to 7 on an adapted Agency for Healthcare Research and Quality scale.

Citation impact

62

total citations

FWCI: 117.26
Percentile: 100%
References: 43

Citations per year

Authors

24

Topics & keywords

Topics

Keywords

Medicine
Narrative
Likert scale
Scale (ratio)
Family medicine
Health care
Documentation
Statistics

UN Sustainable Development Goals

Quality Education

No related works found for this paper.