articleApr 15, 2009Closed access
Deep Boltzmann machines
Abstract
We present a new learning algorithm for Boltz-mann machines that contain many layers of hid-den variables. Data-dependent expectations are estimated using a variational approximation that tends to focus on a single mode, and data-independent expectations are approximated us-ing persistent Markov chains. The use of two quite different techniques for estimating the two types of expectation that enter into the gradient of the log-likelihood makes it practical to learn Boltzmann machines with multiple hidden lay-ers and millions of parameters. The learning can be made more efficient by using a layer-by-layer “pre-training ” phase that allows variational in-ference to be initialized with a single bottom-up pass. We…
Citation impact
1,774
total citations
- FWCI
- 38.39
- Percentile
- 100%
- References
- 21
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Boltzmann machine
- MNIST database
- Restricted Boltzmann machine
- Focus (optics)
- Computer science
- Inference
- Artificial intelligence
- Boltzmann constant
No related works found for this paper.