Evaluating large language models as agents in the clinic
Hearst (United States) · University of California, Berkeley · +1 more institution
Abstract
Recent developments in large language models (LLMs) have unlocked opportunities for healthcare, from information synthesis to clinical decision support. These LLMs are not just capable of modeling language, but can also act as intelligent “agents” that interact with stakeholders in open-ended conversations and even influence clinical decision-making. Rather than relying on benchmarks that measure a model’s ability to process clinical data or answer standardized test questions, LLM agents can be modeled in high-fidelity simulations of clinical settings and should be assessed for their impact on clinical workflows. These evaluation frameworks, which we refer to as “Artificial Intelligence Structured Clinical…
Citation impact
- FWCI
- 12.94
- Percentile
- 100%
- References
- 28
Authors
6- NMNikita MehandruCorresponding
Hearst (United States), University of California, Berkeley
- BYBrenda Y. Miao
University of California, San Francisco
- EREduardo Rodriguez Almaraz
University of California, San Francisco
- MSMadhumita Sushil
University of California, San Francisco
- AJAtul J. Butte
University of California, San Francisco
Topics & keywords
- Workflow
- Process (computing)
- Computer science
- Fidelity
- Corporate governance
- Clinical decision support system
- Knowledge management
- Data science
- Quality Education