Regularization of Neural Networks using DropConnect
Courant Institute of Mathematical Sciences · Université Toulouse III - Paul Sabatier · +4 more institutions
Abstract
We introduce DropConnect, a generalization of Dropout (Hinton et al., 2012), for regular-izing large fully-connected layers within neu-ral networks. When training with Dropout, a randomly selected subset of activations are set to zero within each layer. DropCon-nect instead sets a randomly selected sub-set of weights within the network to zero. Each unit thus receives input from a ran-dom subset of units in the previous layer. We derive a bound on the generalization per-formance of both Dropout and DropCon-nect. We then evaluate DropConnect on a range of datasets, comparing to Dropout, and show state-of-the-art results on several image recognition benchmarks by aggregating mul-tiple DropConnect-trained models.…
Citation impact
- FWCI
- 108.90
- Percentile
- 100%
- References
- 13
Authors
5- LWLi WanCorresponding
Courant Institute of Mathematical Sciences
- MDMatthew D. Zeiler
Courant Institute of Mathematical Sciences
- SZSixin Zhang
Université Toulouse III - Paul Sabatier, Université Toulouse-I-Capitole, Institut de Recherche en Informatique de Toulouse, Université Toulouse - Jean Jaurès, Institut Polytechnique de Bordeaux
- YLYann Lecun
Courant Institute of Mathematical Sciences
- RFRob Fergus
Courant Institute of Mathematical Sciences
Topics & keywords
- Dropout (neural networks)
- Regularization (linguistics)
- Artificial neural network
- Computer science
- Generalization
- Artificial intelligence
- Set (abstract data type)
- Layer (electronics)