ResNeSt: Split-Attention Networks

Zhang, Hang; Wu, Chongruo; Zhang, Zhongyue; Zhu, Yi; Lin, Haibin; Zhang, Zhi; Sun, Yue; He, Tong; Mueller, Jonas; Manmatha, R.; Li, Mu; Smola, Alexander J.

doi:10.1109/cvprw56347.2022.00309

article2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)Jun 1, 2022Closed access

ResNeSt: Split-Attention Networks

HZHang Zhang CWChongruo Wu ZZZhongyue Zhang YZYi Zhu HLHaibin Lin

University of California, Davis · Canadian Parks and Wilderness Society · +2 more institutions

Indexed incrossref

Abstract

The ability to learn richer network representations generally boosts the performance of deep learning models. To improve representation-learning in convolutional neural networks, we present a multi-branch architecture, which applies channel-wise attention across different network branches to leverage the complementary strengths of both feature-map attention and multi-path representation. Our proposed Split-Attention module provides a simple and modular computation block that can serve as a drop-in replacement for the popular residual block, while producing more diverse representations via cross-feature interactions. Adding a Split-Attention module into the architecture design space of RegNet-Y and FBNetV2…

Citation impact

1,274

total citations

FWCI: 54.70
Percentile: 100%
References: 124

Citations per year

Authors

12

Topics & keywords

Topics

Keywords

Computer science
Leverage (statistics)
Residual
Modular design
Feature learning
Artificial intelligence
Convolutional neural network
Block (permutation group theory)

No related works found for this paper.