Vision–language foundation model for echocardiogram interpretation
Cedars-Sinai Medical Center · Cedars-Sinai Smidt Heart Institute · +3 more institutions
Abstract
The development of robust artificial intelligence models for echocardiography has been limited by the availability of annotated clinical data. Here, to address this challenge and improve the performance of cardiac imaging models, we developed EchoCLIP, a vision-language foundation model for echocardiography, that learns the relationship between cardiac ultrasound images and the interpretations of expert cardiologists across a wide range of patients and indications for imaging. After training on 1,032,975 cardiac ultrasound videos and corresponding expert text, EchoCLIP performs well on a diverse range of benchmarks for cardiac image interpretation, despite not having been explicitly trained for individual…
Citation impact
- FWCI
- 62.67
- Percentile
- 100%
- References
- 42
Authors
4- MCMatthew ChristensenCorresponding
Cedars-Sinai Medical Center, Cedars-Sinai Smidt Heart Institute
- MVMiloš Vukadinovic
Cedars-Sinai Medical Center, University of California, Los Angeles, Cedars-Sinai Smidt Heart Institute
- NYNeal Yuan
San Francisco VA Medical Center, University of California, San Francisco
- DODavid Ouyang
Cedars-Sinai Medical Center, Cedars-Sinai Smidt Heart Institute
Topics & keywords
- Context (archaeology)
- Ejection fraction
- Artificial intelligence
- Intracardiac injection
- Medicine
- Computer science
- Cardiac imaging
- Machine learning