preprintarXiv (Cornell University)May 20, 2016GREEN OA

Residual Networks Behave Like Ensembles of Relatively Shallow Networks

Cornell University

Indexed inarxivdatacite

Abstract

In this work we propose a novel interpretation of residual networks showing that they can be seen as a collection of many paths of differing length. Moreover, residual networks seem to enable very deep networks by leveraging only the short paths during training. To support this observation, we rewrite residual networks as an explicit collection of paths. Unlike traditional models, paths through residual networks vary in length. Further, a lesion study reveals that these paths show ensemble-like behavior in the sense that they do not strongly depend on each other. Finally, and most surprising, most paths are shorter than one might expect, and only the short paths are needed during training, as longer paths do…

Citation impact

601
total citations
FWCI
Percentile
References
17
Citations per year

Authors

3

Topics & keywords

Keywords
  • Residual
  • Computer science
  • Interpretation (philosophy)
  • Artificial intelligence
  • Key (lock)
  • Algorithm
No related works found for this paper.