A systematic review of large language model (LLM) evaluations in clinical medicine

Iran University of Medical Sciences · Shaheed Rajaei Cardiovascular Medical and Research Center

PubMed
Indexed incrossrefdoajpubmed

Abstract

Background

Large Language Models (LLMs), advanced AI tools based on transformer architectures, demonstrate significant potential in clinical medicine by enhancing decision support, diagnostics, and medical education. However, their integration into clinical workflows requires rigorous evaluation to ensure reliability, safety, and ethical alignment.

Objective

This systematic review examines the evaluation parameters and methodologies applied to LLMs in clinical medicine, highlighting their capabilities, limitations, and application trends.

No related works found for this paper.