articleDec 1, 2009Closed access

Exploring Strategies for Training Deep Neural Networks

Abstract

Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization often appears to get stuck in poor solutions. Hinton et al. recently proposed a greedy layer-wise unsupervised learning procedure relying on the training algorithm of restricted Boltzmann machines (RBM) to initialize the parameters of a deep belief network (DBN), a generative model with many layers of hidden causal variables. This was followed by the proposal of another greedy layer-wise procedure, relying on the…

Citation impact

1,117
total citations
FWCI
28.87
Percentile
100%
References
60
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Initialization
  • Artificial neural network
  • Leverage (statistics)
  • Deep belief network
  • Deep learning
  • Machine learning
No related works found for this paper.