Exploring Strategies for Training Deep Neural Networks

Larochelle, Hugo; Bengio, Yoshua; Louradour, Jérôme; Lamblin, Pascal

doi:10.1145/1577069.1577070

articleDec 1, 2009Closed access

Exploring Strategies for Training Deep Neural Networks

HLHugo Larochelle YBYoshua Bengio JLJérôme Louradour PLPascal Lamblin

Abstract

Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization often appears to get stuck in poor solutions. Hinton et al. recently proposed a greedy layer-wise unsupervised learning procedure relying on the training algorithm of restricted Boltzmann machines (RBM) to initialize the parameters of a deep belief network (DBN), a generative model with many layers of hidden causal variables. This was followed by the proposal of another greedy layer-wise procedure, relying on the…

Citation impact

1,117

total citations

FWCI: 28.87
Percentile: 100%
References: 60

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Initialization
Artificial neural network
Leverage (statistics)
Deep belief network
Deep learning
Machine learning

No related works found for this paper.