Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers

National Institute of Japanese Literature · Wuhan University · +2 more institutions

Indexed incrossref

Abstract

Weakly-supervised semantic segmentation (WSSS) with image-level labels is an important and challenging task. Due to the high training efficiency, end-to-end solutions for WSSS have received increasing attention from the community. However, current methods are mainly based on convolutional neural networks and fail to explore the global information properly, thus usually resulting in incomplete object regions. In this paper, to address the aforementioned problem, we introduce Transformers, which naturally integrate global information, to generate more integral initial pseudo labels for end-to-end WSSS. Motivated by the inherent consistency between the self-attention in Transformers and the semantic affinity, we…

Citation impact

259
total citations
FWCI
14.11
Percentile
100%
References
77
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Pascal (unit)
  • Segmentation
  • End-to-end principle
  • Transformer
  • Artificial intelligence
  • Convolutional neural network
  • Pattern recognition (psychology)
No related works found for this paper.

Funding