Large language models for conducting systematic reviews: on the rise, but not yet ready for use—a scoping review

Lieberum, Judith-Lisa; Toews, Markus; Metzendorf, Maria‐Inti; Heilmeyer, Felix; Siemens, Waldemar; Haverkamp, Christian; Böhringer, Daniel; Meerpohl, Joerg J; Eisele‐Metzger, Angelika

doi:10.1016/j.jclinepi.2025.111746

reviewJournal of Clinical EpidemiologyFeb 26, 2025HYBRID OA

Large language models for conducting systematic reviews: on the rise, but not yet ready for use—a scoping review

JLJudith-Lisa LieberumMTMarkus ToewsMMMaria‐Inti Metzendorf FHFelix Heilmeyer WSWaldemar Siemens

University Medical Center Freiburg · Heinrich Heine University Düsseldorf

PubMed

Indexed incrossrefpubmed

Abstract

Methods

We systematically searched MEDLINE, Web of Science, IEEEXplore, ACM Digital Library, Europe PMC (preprints), Google Scholar, and conducted an additional hand search (last search: February 26, 2024). We included scientific articles in English or German, published from April 2021 onwards, building upon the results of a mapping review that has not yet identified LLM applications to support SRs. Two reviewers independently screened studies for eligibility; after piloting, 1 reviewer extracted data, checked by another.

Results

Our database search yielded 8054 hits, and we identified 33 articles from our hand search. We finally included 37 articles on LLM support. LLM approaches covered 10 of 13 defined SR steps, most frequently literature search (n = 15, 41%), study selection (n = 14, 38%), and data extraction (n = 11, 30%). The mostly recurring LLM was Generative Pretrained Transformer (GPT) (n = 33, 89%). Validation studies were predominant (n = 21, 57%). In half of the studies, authors evaluated LLM use as promising (n = 20, 54%), one-quarter as neutral (n = 9, 24%) and one-fifth as nonpromising (n = 8, 22%).

Citation impact

79

total citations

FWCI: 82.11
Percentile: 100%
References: 45

Citations per year

Authors

9

JL
Judith-Lisa LieberumCorresponding
University Medical Center Freiburg
MT
Markus Toews
University Medical Center Freiburg
MM
Maria‐Inti Metzendorf
Heinrich Heine University Düsseldorf
FH
Felix Heilmeyer
University Medical Center Freiburg
WS
Waldemar Siemens
University Medical Center Freiburg

Topics & keywords

Topics

Keywords

Systematic review
Medicine
MEDLINE
Data science
Psychology
Computer science
Political science

No related works found for this paper.