Image Segmentation Using Text and Image Prompts

Lüddecke, Timo; Ecker, Alexander S.

doi:10.1109/cvpr52688.2022.00695

article2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Jun 1, 2022GREEN OA

Image Segmentation Using Text and Image Prompts

TLTimo Lüddecke ASAlexander S. Ecker

Czech Academy of Sciences, Institute of Computer Science

Indexed incrossref

Abstract

Image segmentation is usually addressed by training a model for a fixed set of object classes. Incorporating additional classes or more complex queries later is expensive as it requires re-training the model on a dataset that encompasses these expressions. Here we propose a system that can generate image segmentations based on arbitrary prompts at test time. A prompt can be either a text or an image. This approach enables us to create a unified model (trained once) for three common segmentation tasks, which come with distinct challenges: referring expression segmentation, zero-shot segmentation and one-shot segmentation. We build upon the CLIP model as a backbone which we extend with a transformer-based…

Citation impact

463

total citations

FWCI: 24.43
Percentile: 100%
References: 89

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Segmentation
Segmentation-based object categorization
Scale-space segmentation
Artificial intelligence
Image segmentation
Computer vision
Pattern recognition (psychology)

No related works found for this paper.