Greedy Layer-Wise Training of Deep Networks

Bengio, Yoshua; Lamblin, Pascal; Popovici, Dan; Larochelle, Hugo

doi:10.7551/mitpress/7503.003.0024

book chapterThe MIT Press eBooksSep 7, 2007Closed access

Greedy Layer-Wise Training of Deep Networks

YBYoshua Bengio PLPascal Lamblin DPDan Popovici HLHugo Larochelle

Université de Montréal

Indexed incrossref

Abstract

Complexity theory of circuits strongly suggests that deep architectures can be much more efficient (sometimes exponentially) than shallow architectures, in terms of computational elements required to represent some functions. Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization appears to often get stuck in poor solutions. Hinton et al. recently introduced a greedy layer-wise unsupervised learning algorithm for Deep Belief Networks (DBN), a generative model with many…

Citation impact

4,696

total citations

FWCI: 41.38
Percentile: 100%
References: 16

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Training (meteorology)
Layer (electronics)
Computer science
Artificial intelligence
Geography
Materials science
Composite material
Meteorology

No related works found for this paper.