A systematic review of large language model (LLM) evaluations in clinical medicine
Iran University of Medical Sciences · Shaheed Rajaei Cardiovascular Medical and Research Center
Abstract
Large Language Models (LLMs), advanced AI tools based on transformer architectures, demonstrate significant potential in clinical medicine by enhancing decision support, diagnostics, and medical education. However, their integration into clinical workflows requires rigorous evaluation to ensure reliability, safety, and ethical alignment.
This systematic review examines the evaluation parameters and methodologies applied to LLMs in clinical medicine, highlighting their capabilities, limitations, and application trends.
Citation impact
- FWCI
- 102.35
- Percentile
- 100%
- References
- 16
Authors
6- SSSina ShoolCorresponding
Iran University of Medical Sciences, Shaheed Rajaei Cardiovascular Medical and Research Center
- SASara Adimi
Iran University of Medical Sciences, Shaheed Rajaei Cardiovascular Medical and Research Center
- RSReza Saboori Amleshi
Iran University of Medical Sciences, Shaheed Rajaei Cardiovascular Medical and Research Center
- EBEhsan Bitaraf
Iran University of Medical Sciences, Shaheed Rajaei Cardiovascular Medical and Research Center
- RGReza Golpira
Iran University of Medical Sciences, Shaheed Rajaei Cardiovascular Medical and Research Center
Topics & keywords
- Health informatics
- Computer science
- Medicine
- Medical education
- Public health
- Nursing
- Quality Education