articleNature CommunicationsJul 28, 2023GOLD OA

Knowledge-enhanced visual-language pre-training on chest radiology images

Shanghai Jiao Tong University · Shandong Jiaotong University · +2 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

While multi-modal foundation models pre-trained on large-scale data have been successful in natural language understanding and vision recognition, their use in medical domains is still limited due to the fine-grained nature of medical tasks and the high demand for domain knowledge. To address this challenge, we propose an approach called Knowledge-enhanced Auto Diagnosis (KAD) which leverages existing medical domain knowledge to guide vision-language pre-training using paired chest X-rays and radiology reports. We evaluate KAD on four external X-ray datasets and demonstrate that its zero-shot performance is not only comparable to that of fully supervised models but also superior to the average of three expert…

No related works found for this paper.

Funding