Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models
Shanghai University · Institute for Advanced Study
Abstract
Open-vocabulary camouflaged object segmentation (OVCOS) seeks to segment and classify camouflaged objects in arbitrary categories, presenting unique challenges due to visual ambiguity and unseen categories. Recent approaches typically adopt a two-stage paradigm: they first segment objects, and then classify the segmented regions using vision language models (VLMs). However, such methods (i) suffer from a domain gap caused by the mismatch between VLMs' full-image training and cropped-region inferencing, and (ii) depend on generic segmentation models optimized for well-delineated objects which are less effective for camouflaged objects. Without explicit guidance, generic segmentation models often overlook subtle…
Citation impact
- FWCI
- 108.09
- Percentile
- 100%
- References
- 0
Authors
7Topics & keywords
- Segmentation
- Leverage (statistics)
- Ambiguity
- Object (grammar)
- Context (archaeology)
- Market segmentation
- Scale-space segmentation
- Semantics (computer science)