Residual Networks Behave Like Ensembles of Relatively Shallow Networks
Indexed inarxivdatacite
Abstract
In this work we propose a novel interpretation of residual networks showing that they can be seen as a collection of many paths of differing length. Moreover, residual networks seem to enable very deep networks by leveraging only the short paths during training. To support this observation, we rewrite residual networks as an explicit collection of paths. Unlike traditional models, paths through residual networks vary in length. Further, a lesion study reveals that these paths show ensemble-like behavior in the sense that they do not strongly depend on each other. Finally, and most surprising, most paths are shorter than one might expect, and only the short paths are needed during training, as longer paths do…
Citation impact
601
total citations
- FWCI
- —
- Percentile
- —
- References
- 17
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Residual
- Computer science
- Interpretation (philosophy)
- Artificial intelligence
- Key (lock)
- Algorithm
No related works found for this paper.