LLM-assisted systematic review of large language models in clinical medicine

Chen, Sully F.; Alyakin, Anton; Seas, Andreas; Yang, Eunice; Choi, Jinhyuk; Lee, Jin Vivian; Chen, Amelia L.; Warman, Pranav I; Bitolas, Rochelle; Steele, Robert; Alber, Daniel A.; Oermann, Eric K.

doi:10.1038/s41591-026-04229-5

articleNature MedicineMar 1, 2026HYBRID OA

LLM-assisted systematic review of large language models in clinical medicine

SFSully F. Chen AAAnton Alyakin ASAndreas Seas EYEunice Yang JCJinhyuk Choi

Duke University · Washington University in St. Louis · +10 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

Clinical evaluations of large language models (LLMs) have rapidly expanded since 2022, yet their evidence base remains opaque. The overwhelming volume of studies creates challenges for manual curation and review. However, LLMs themselves offer the scalability and capability to evaluate the ever-growing evidence base. This LLM-assisted review identified 4,609 peer-reviewed studies in clinical medicine between January 2022 and September 2025, equating to roughly 3.2 papers per day. Only 1,048 studies used real-world patient data and of these only 19 were prospective randomized trials; most addressed simulated scenarios (n = 1,857) or exam-style tasks (n = 1,704). ChatGPT and related OpenAI models constitute…

Citation impact

11

total citations

FWCI: 95.93
Percentile: 100%
References: 17

Too recent for citation history.

Authors

12

Topics & keywords

Topics

Keywords

MEDLINE
Task (project management)
Clinical trial
Alternative medicine
Equating
Evidence-based medicine
Randomized controlled trial
Scale (ratio)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.