A multimodal vision foundation model for clinical dermatology

Yan, Siyuan; Yu, Zhen; Primiero, Clare; Vico‐Alonso, Cristina; Wang, Zhonghua; Yang, Litao; Tschandl, Philipp; Hu, Ming; Ju, Lie; Tan, Gin; Tang, Vincent; Ng, Aik Beng; Powell, David; Bonnington, C. Paul; See, Simon; Magnaterra, Elisabetta; Ferguson, Peter M.; Nguyen, Jennifer; Guitera, Pascale; Bañuls, José; Janda, Monika; Mar, Victoria; Kittler, Harald; Soyer, H. Peter; Ge, Zongyuan

doi:10.1038/s41591-025-03747-y

articleNature MedicineJun 6, 2025HYBRID OA

A multimodal vision foundation model for clinical dermatology

SYSiyuan Yan ZYZhen Yu CPClare Primiero CVCristina Vico‐Alonso ZWZhonghua Wang

Monash Health · Monash University · +9 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

Diagnosing and treating skin diseases require advanced visual skills across domains and the ability to synthesize information from multiple imaging modalities. While current deep learning models excel at specific tasks such as skin cancer diagnosis from dermoscopic images, they struggle to meet the complex, multimodal requirements of clinical practice. Here we introduce PanDerm, a multimodal dermatology foundation model pretrained through self-supervised learning on over 2 million real-world skin disease images from 11 clinical institutions across 4 imaging modalities. We evaluated PanDerm on 28 diverse benchmarks, including skin cancer screening, risk stratification, differential diagnosis of common and rare…

Citation impact

59

total citations

FWCI: 46.96
Percentile: 100%
References: 84

Citations per year

Authors

25

Topics & keywords

Topics

Keywords

Skin cancer
Modalities
Modality (human–computer interaction)
Medicine
Medical imaging
Segmentation
Artificial intelligence
Medical physics

No related works found for this paper.