Evaluation and mitigation of the limitations of large language models in clinical decision-making
TUM Klinikum · Technical University of Munich · +4 more institutions
Abstract
Clinical decision-making is one of the most impactful parts of a physician's responsibilities and stands to benefit greatly from artificial intelligence solutions and large language models (LLMs) in particular. However, while LLMs have achieved excellent performance on medical licensing exams, these tests fail to assess many skills necessary for deployment in a realistic clinical decision-making environment, including gathering information, adhering to guidelines, and integrating into clinical workflows. Here we have created a curated dataset based on the Medical Information Mart for Intensive Care database spanning 2,400 real patient cases and four common abdominal pathologies as well as a framework to…
Citation impact
- FWCI
- 56.84
- Percentile
- 100%
- References
- 72
Authors
11Topics & keywords
- Clinical decision making
- Computer science
- Intensive care medicine
- Risk analysis (engineering)
- Medicine
- Management science
- Engineering
- Peace, Justice and strong institutions