Greedy Layer-Wise Training of Deep Networks
Indexed incrossref
Abstract
Complexity theory of circuits strongly suggests that deep architectures can be much more efficient (sometimes exponentially) than shallow architectures, in terms of computational elements required to represent some functions. Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization appears to often get stuck in poor solutions. Hinton et al. recently introduced a greedy layer-wise unsupervised learning algorithm for Deep Belief Networks (DBN), a generative model with many…
Citation impact
4,696
total citations
- FWCI
- 41.38
- Percentile
- 100%
- References
- 16
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Training (meteorology)
- Layer (electronics)
- Computer science
- Artificial intelligence
- Geography
- Materials science
- Composite material
- Meteorology
No related works found for this paper.