InceptionNeXt: When Inception Meets ConvNeXt

Yu, Weihao; Zhou, Pan; Yan, Shuicheng; Wang, Xinchao

doi:10.1109/cvpr52733.2024.00542

articleJun 16, 2024Closed access

InceptionNeXt: When Inception Meets ConvNeXt

WYWeihao Yu PZPan Zhou SYShuicheng Yan XWXinchao Wang

National University of Singapore · Singapore Management University · +1 more institution

Indexed incrossref

Abstract

Inspired by the long-range modeling ability of ViTs, large-kernel convolutions are widely studied and adopted recently to enlarge the receptive field and improve model performance, like the remarkable work ConvNeXt which employs 7 × 7 depthwise convolution. Although such depth-wise operator only consumes a few FLOPs, it largely harms the model efficiency on powerful computing devices due to the high memory access costs. For example, ConvNeXt-T has similar FLOPs with ResNet-50 but only achieves ~60% throughputs when trained on A100 GPUs with full precision. Although reducing the kernel size of ConvNeXt can improve speed, it results in significant performance degradation, which poses a challenging problem: How…

Citation impact

333

total citations

FWCI: 301.67
Percentile: 100%
References: 100

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science

No related works found for this paper.

Funding

NR
National Research Foundation Singapore
Award: A0005947-21-00