CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection
City University of Hong Kong · Johns Hopkins University · +4 more institutions
Abstract
An increasing number of public datasets have shown a marked impact on automated organ segmentation and tumor detection. However, due to the small size and partially labeled problem of each dataset, as well as a limited investigation of diverse types of tumors, the resulting models are often limited to segmenting specific organs/tumors and ignore the semantics of anatomical structures, nor can they be extended to novel domains. To address these issues, we propose the CLIP-Driven Universal Model, which incorporates text embedding learned from Contrastive Language-Image Pre-training (CLIP) to segmentation models. This CLIPbased label encoding captures anatomical relationships, enabling the model to learn a…
Citation impact
- FWCI
- 63.91
- Percentile
- 100%
- References
- 113
Authors
10Topics & keywords
- Segmentation
- Computer science
- Embedding
- Artificial intelligence
- Feature (linguistics)
- Encoding (memory)
- Semantics (computer science)
- Image segmentation