Human–large language model collaboration in clinical medicine: a systematic review and meta-analysis
Chinese Academy of Medical Sciences & Peking Union Medical College · Peking Union Medical College Hospital · +4 more institutions
Abstract
Human-AI collaboration (H + AI) using large language models (LLMs) offers a promising approach to enhance clinical reasoning, documentation, and interpretation tasks. Following PRISMA 2020 (PROSPERO registration: CRD420251068272), we systematically compared H + AI with human-only (H) workflows, searching four databases through June 28, 2025. Ten peer-reviewed studies met eligibility criteria, with three preprints informing sensitivity analyses only. Diagnostic/interpretation accuracy (k = 2) showed a positive trend for H + AI (Risk Ratio [RR] 1.59), but was statistically imprecise and non-significant (95% CI 0.08 to 32.74), with 95% prediction intervals (PI) crossing the null. Composite diagnostic/management…
Citation impact
- FWCI
- 44.53
- Percentile
- 100%
- References
- 71
Authors
10- GWGuoyong Wang
Chinese Academy of Medical Sciences & Peking Union Medical College, Peking Union Medical College Hospital
- KZK Zhang
Children's Hospital of Fudan University
- JJJiyue Jiang
Chinese University of Hong Kong
- CWChaonan Wang
Chinese Academy of Medical Sciences & Peking Union Medical College, Peking Union Medical College Hospital
- HBHui Bi
Chinese Academy of Medical Sciences & Peking Union Medical College, Peking Union Medical College Hospital
Topics & keywords
- Documentation
- Quality (philosophy)
- Quality assessment
- Mean difference
- Core (optical fiber)
- MEDLINE
- Contrast (vision)
- Clinical trial