Knowledge-enhanced visual-language pre-training on chest radiology images
Shanghai Jiao Tong University · Shandong Jiaotong University · +2 more institutions
Abstract
While multi-modal foundation models pre-trained on large-scale data have been successful in natural language understanding and vision recognition, their use in medical domains is still limited due to the fine-grained nature of medical tasks and the high demand for domain knowledge. To address this challenge, we propose an approach called Knowledge-enhanced Auto Diagnosis (KAD) which leverages existing medical domain knowledge to guide vision-language pre-training using paired chest X-rays and radiology reports. We evaluate KAD on four external X-ray datasets and demonstrate that its zero-shot performance is not only comparable to that of fully supervised models but also superior to the average of three expert…
Citation impact
- FWCI
- 38.97
- Percentile
- 100%
- References
- 32
Authors
5- XZXiaoman ZhangCorresponding
Shanghai Jiao Tong University, Shandong Jiaotong University, Beijing Academy of Artificial Intelligence, Shanghai Artificial Intelligence Laboratory
- CWChaoyi Wu
Shanghai Jiao Tong University, Shandong Jiaotong University, Beijing Academy of Artificial Intelligence, Shanghai Artificial Intelligence Laboratory
- YZYa Zhang
Shanghai Jiao Tong University, Shandong Jiaotong University, Beijing Academy of Artificial Intelligence, Shanghai Artificial Intelligence Laboratory
- WXWeidi Xie
Shanghai Jiao Tong University, Shandong Jiaotong University, Beijing Academy of Artificial Intelligence
- YWYanfeng Wang
Shanghai Jiao Tong University, Beijing Academy of Artificial Intelligence, Shanghai Artificial Intelligence Laboratory
Topics & keywords
- Computer science
- Domain (mathematical analysis)
- Artificial intelligence
- Domain knowledge
- Annotation
- Machine learning
- Natural language processing
- Data science
- Quality Education