Title and abstract screening for literature reviews using large language models: an exploratory study in the biomedical domain
University of St.Gallen · Kantonsspital St. Gallen · +5 more institutions
Abstract
Systematically screening published literature to determine the relevant publications to synthesize in a review is a time-consuming and difficult task. Large language models (LLMs) are an emerging technology with promising capabilities for the automation of language-related tasks that may be useful for such a purpose.
LLMs were used as part of an automated system to evaluate the relevance of publications to a certain topic based on defined criteria and based on the title and abstract of each publication. A Python script was created to generate structured prompts consisting of text strings for instruction, title, abstract, and relevant criteria to be provided to an LLM. The relevance of a publication was evaluated by the LLM on a Likert scale (low relevance to high relevance). By specifying a threshold, different classifiers for inclusion/exclusion of publications could then be defined. The approach was used with four different openly available LLMs on ten published data sets of biomedical literature reviews and on a newly human-created data set for a hypothetical new systematic literature review.
Citation impact
- FWCI
- 44.11
- Percentile
- 100%
- References
- 49
Authors
5- FDFabio DennstädtCorresponding
University of St.Gallen, Kantonsspital St. Gallen, University Hospital of Bern
- JZJohannes Zink
University of Würzburg
- PMPaul Martin Putora
University of Bern, Kantonsspital St. Gallen, University Hospital of Bern
- JHJanna Hastings
SIB Swiss Institute of Bioinformatics, University of Zurich, University of St.Gallen
- NČNikola Čihorić
University of Bern, University Hospital of Bern
Topics & keywords
- Medicine
- Task (project management)
- Domain (mathematical analysis)
- Automation
- Data science
- Computer science
- Systems engineering