Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers
Zhejiang University · Inception Institute of Artificial Intelligence
Abstract
Most polyp segmentation methods use convolutional neural networks (CNNs) as their backbone, leading to two key issues when exchanging information between the encoder and decoder: (1) taking into account the differences in contribution between different-level features, and (2) designing an effective mechanism for fusing these features. Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful and robust representations. In addition, considering the image acquisition influence and elusive properties of polyps, we introduce three standard modules, including a cascaded fusion module (CFM), a camouflage identification module (CIM), and a similarity aggregation module (SAM). Among…
Citation impact
- FWCI
- 35.26
- Percentile
- 100%
- References
- 75
Authors
6Topics & keywords
- Computer science
- Encoder
- Artificial intelligence
- Segmentation
- Transformer
- Convolutional neural network
- Pattern recognition (psychology)
- Computer vision