Aggregated Residual Transformations for Deep Neural Networks
UC San Diego Health System · Meta (Israel)
Abstract
We present a simple, highly modularized network architecture for image classification. Our network is constructed by repeating a building block that aggregates a set of transformations with the same topology. Our simple design results in a homogeneous, multi-branch architecture that has only a few hyper-parameters to set. This strategy exposes a new dimension, which we call cardinality (the size of the set of transformations), as an essential factor in addition to the dimensions of depth and width. On the ImageNet-1K dataset, we empirically show that even under the restricted condition of maintaining complexity, increasing cardinality is able to improve classification accuracy. Moreover, increasing cardinality…
Citation impact
- FWCI
- 631.39
- Percentile
- 100%
- References
- 65
Authors
5Topics & keywords
- Cardinality (data modeling)
- Set (abstract data type)
- Computer science
- Dimension (graph theory)
- Simple (philosophy)
- Block (permutation group theory)
- Artificial neural network
- Residual