A Fast Learning Algorithm for Deep Belief Nets
University of Toronto · National University of Singapore
Abstract
We show how to use "complementary priors" to eliminate the explaining-away effects that make inference difficult in densely connected belief nets that have many hidden layers. Using complementary priors, we derive a fast, greedy algorithm that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory. The fast, greedy algorithm is used to initialize a slower learning procedure that fine-tunes the weights using a contrastive version of the wake-sleep algorithm. After fine-tuning, a network with three hidden layers forms a very good generative model of the joint distribution of handwritten digit images and their labels. This generative model…
Citation impact
- FWCI
- 81.10
- Percentile
- 100%
- References
- 31
Authors
3Topics & keywords
- Computer science
- Associative property
- Prior probability
- Discriminative model
- Generative model
- Content-addressable memory
- Artificial intelligence
- Inference
- Reduced inequalities