A multimodal vision foundation model for clinical dermatology
Monash Health · Monash University · +9 more institutions
Abstract
Diagnosing and treating skin diseases require advanced visual skills across domains and the ability to synthesize information from multiple imaging modalities. While current deep learning models excel at specific tasks such as skin cancer diagnosis from dermoscopic images, they struggle to meet the complex, multimodal requirements of clinical practice. Here we introduce PanDerm, a multimodal dermatology foundation model pretrained through self-supervised learning on over 2 million real-world skin disease images from 11 clinical institutions across 4 imaging modalities. We evaluated PanDerm on 28 diverse benchmarks, including skin cancer screening, risk stratification, differential diagnosis of common and rare…
Citation impact
- FWCI
- 46.96
- Percentile
- 100%
- References
- 84
Authors
25Topics & keywords
- Skin cancer
- Modalities
- Modality (human–computer interaction)
- Medicine
- Medical imaging
- Segmentation
- Artificial intelligence
- Medical physics