articleApr 15, 2009Closed access

Deep Boltzmann machines

University of Toronto

Abstract

We present a new learning algorithm for Boltz-mann machines that contain many layers of hid-den variables. Data-dependent expectations are estimated using a variational approximation that tends to focus on a single mode, and data-independent expectations are approximated us-ing persistent Markov chains. The use of two quite different techniques for estimating the two types of expectation that enter into the gradient of the log-likelihood makes it practical to learn Boltzmann machines with multiple hidden lay-ers and millions of parameters. The learning can be made more efficient by using a layer-by-layer “pre-training ” phase that allows variational in-ference to be initialized with a single bottom-up pass. We…

Citation impact

1,774
total citations
FWCI
38.39
Percentile
100%
References
21
Citations per year

Authors

2

Topics & keywords

Keywords
  • Boltzmann machine
  • MNIST database
  • Restricted Boltzmann machine
  • Focus (optics)
  • Computer science
  • Inference
  • Artificial intelligence
  • Boltzmann constant
No related works found for this paper.