Deep Boltzmann machines

Salakhutdinov, Ruslan; Hinton, Geoffrey E.

articleApr 15, 2009Closed access

Deep Boltzmann machines

RSRuslan Salakhutdinov GEGeoffrey E. Hinton

Abstract

We present a new learning algorithm for Boltz-mann machines that contain many layers of hid-den variables. Data-dependent expectations are estimated using a variational approximation that tends to focus on a single mode, and data-independent expectations are approximated us-ing persistent Markov chains. The use of two quite different techniques for estimating the two types of expectation that enter into the gradient of the log-likelihood makes it practical to learn Boltzmann machines with multiple hidden lay-ers and millions of parameters. The learning can be made more efficient by using a layer-by-layer “pre-training ” phase that allows variational in-ference to be initialized with a single bottom-up pass. We…

Citation impact

1,774

total citations

FWCI: 38.39
Percentile: 100%
References: 21

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Boltzmann machine
MNIST database
Restricted Boltzmann machine
Focus (optics)
Computer science
Inference
Artificial intelligence
Boltzmann constant

No related works found for this paper.