InceptionNeXt: When Inception Meets ConvNeXt
National University of Singapore · Singapore Management University · +1 more institution
Abstract
Inspired by the long-range modeling ability of ViTs, large-kernel convolutions are widely studied and adopted recently to enlarge the receptive field and improve model performance, like the remarkable work ConvNeXt which employs 7 × 7 depthwise convolution. Although such depth-wise operator only consumes a few FLOPs, it largely harms the model efficiency on powerful computing devices due to the high memory access costs. For example, ConvNeXt-T has similar FLOPs with ResNet-50 but only achieves ~60% throughputs when trained on A100 GPUs with full precision. Although reducing the kernel size of ConvNeXt can improve speed, it results in significant performance degradation, which poses a challenging problem: How…
Citation impact
- FWCI
- 301.67
- Percentile
- 100%
- References
- 100
Authors
4Topics & keywords
- Computer science