articleDec 1, 2009Closed access
Exploring Strategies for Training Deep Neural Networks
Abstract
Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization often appears to get stuck in poor solutions. Hinton et al. recently proposed a greedy layer-wise unsupervised learning procedure relying on the training algorithm of restricted Boltzmann machines (RBM) to initialize the parameters of a deep belief network (DBN), a generative model with many layers of hidden causal variables. This was followed by the proposal of another greedy layer-wise procedure, relying on the…
Citation impact
1,117
total citations
- FWCI
- 28.87
- Percentile
- 100%
- References
- 60
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Computer science
- Artificial intelligence
- Initialization
- Artificial neural network
- Leverage (statistics)
- Deep belief network
- Deep learning
- Machine learning
No related works found for this paper.