articlenpj Digital MedicineJan 28, 2026GOLD OA

Human–large language model collaboration in clinical medicine: a systematic review and meta-analysis

Chinese Academy of Medical Sciences & Peking Union Medical College · Peking Union Medical College Hospital · +4 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

Human-AI collaboration (H + AI) using large language models (LLMs) offers a promising approach to enhance clinical reasoning, documentation, and interpretation tasks. Following PRISMA 2020 (PROSPERO registration: CRD420251068272), we systematically compared H + AI with human-only (H) workflows, searching four databases through June 28, 2025. Ten peer-reviewed studies met eligibility criteria, with three preprints informing sensitivity analyses only. Diagnostic/interpretation accuracy (k = 2) showed a positive trend for H + AI (Risk Ratio [RR] 1.59), but was statistically imprecise and non-significant (95% CI 0.08 to 32.74), with 95% prediction intervals (PI) crossing the null. Composite diagnostic/management…

No related works found for this paper.

Funding