Rethinking Semantic Segmentation: A Prototype View
ETH Zurich · University of Technology Sydney
Abstract
Prevalent semantic segmentation solutions, despite their different network designs (FCN based or attention based) and mask decoding strategies (parametric softmax based or pixel-query based), can be placed in one category, by considering the softmax weights or query vectors as learnable class prototypes. In light of this prototype view, this study uncovers several limitations of such parametric segmentation regime, and proposes a nonparametric alternative based on non-learnable prototypes. Instead of prior methods learning a single weight/query vector for each class in a fully parametric manner, our model represents each class as a set of non-learnable prototypes, relying solely on the mean fea-tures of…
Citation impact
- FWCI
- 35.93
- Percentile
- 100%
- References
- 176
Authors
4Topics & keywords
- Softmax function
- Computer science
- Segmentation
- Pixel
- Nonparametric statistics
- Artificial intelligence
- Parametric statistics
- Pattern recognition (psychology)