Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers
University of Electronic Science and Technology of China · Changhong (China) · +4 more institutions
Abstract
Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection. However, they suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders, which are not conducive to camou-flaged object detection that explores subtle cues from indistinguishable backgrounds. To address these issues, in this paper, we propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features through progressive shrinking for camou-flaged object detection. Specifically, we propose a non-local token enhancement module (NL-TEM) that…
Citation impact
- FWCI
- 27.84
- Percentile
- 100%
- References
- 69
Authors
7Topics & keywords
- Computer science
- Locality
- Transformer
- Object detection
- Decoding methods
- Artificial intelligence
- Security token
- Shrinkage