ResNeSt: Split-Attention Networks
University of California, Davis · Canadian Parks and Wilderness Society · +2 more institutions
Abstract
The ability to learn richer network representations generally boosts the performance of deep learning models. To improve representation-learning in convolutional neural networks, we present a multi-branch architecture, which applies channel-wise attention across different network branches to leverage the complementary strengths of both feature-map attention and multi-path representation. Our proposed Split-Attention module provides a simple and modular computation block that can serve as a drop-in replacement for the popular residual block, while producing more diverse representations via cross-feature interactions. Adding a Split-Attention module into the architecture design space of RegNet-Y and FBNetV2…
Citation impact
- FWCI
- 54.70
- Percentile
- 100%
- References
- 124
Authors
12Topics & keywords
- Computer science
- Leverage (statistics)
- Residual
- Modular design
- Feature learning
- Artificial intelligence
- Convolutional neural network
- Block (permutation group theory)