Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

Dong, Bo; Wang, Wenhai; Fan, Deng-Ping; Li, Jinpeng; Fu, Huazhu; Shao, Ling

doi:10.26599/air.2023.9150015

articleCAAI Artificial Intelligence ResearchJun 30, 2023DIAMOND OA

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

BDBo Dong WWWenhai Wang DFDeng-Ping Fan JLJinpeng Li HFHuazhu Fu

Zhejiang University · Inception Institute of Artificial Intelligence

Indexed incrossrefdoaj

Abstract

Most polyp segmentation methods use convolutional neural networks (CNNs) as their backbone, leading to two key issues when exchanging information between the encoder and decoder: (1) taking into account the differences in contribution between different-level features, and (2) designing an effective mechanism for fusing these features. Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful and robust representations. In addition, considering the image acquisition influence and elusive properties of polyps, we introduce three standard modules, including a cascaded fusion module (CFM), a camouflage identification module (CIM), and a similarity aggregation module (SAM). Among…

Citation impact

320

total citations

FWCI: 35.26
Percentile: 100%
References: 75

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Encoder
Artificial intelligence
Segmentation
Transformer
Convolutional neural network
Pattern recognition (psychology)
Computer vision

No related works found for this paper.