Expectation-Maximization Attention Networks for Semantic Segmentation
Peking University · Shandong University of Science and Technology · +1 more institution
Abstract
Self-attention mechanism has been widely used for various tasks. It is designed to compute the representation of each position by a weighted sum of the features at all positions. Thus, it can capture long-range relations for computer vision tasks. However, it is computationally consuming. Since the attention maps are computed w.r.t all other positions. In this paper, we formulate the attention mechanism into an expectation-maximization manner and iteratively estimate a much more compact set of bases upon which the attention maps are computed. By a weighted summation upon these bases, the resulting representation is low-rank and deprecates noisy information from the input. The proposed Expectation-Maximization…
Citation impact
- FWCI
- 32.05
- Percentile
- 100%
- References
- 53
Authors
6Topics & keywords
- Pascal (unit)
- Computer science
- Maximization
- Artificial intelligence
- Computation
- Segmentation
- Normalization (sociology)
- Theoretical computer science