Bayesian Learning via Stochastic Gradient Langevin Dynamics

Welling, Max; Teh, Yee Whye

articleJan 1, 2011Closed access

Bayesian Learning via Stochastic Gradient Langevin Dynamics

University of California, Irvine · Oxford Centre for Computational Neuroscience · +2 more institutions

Abstract

In this paper we propose a new framework for learning from large scale datasets based on iterative learning from small mini-batches. By adding the right amount of noise to a standard stochastic gradient optimization algorithm we show that the iterates will converge to samples from the true posterior distribution as we anneal the stepsize. This seamless transition between optimization and Bayesian posterior sampling provides an inbuilt protection against overfitting. We also propose a practical method for Monte Carlo estimates of posterior statistics which monitors a "sampling threshold" and collects samples after it has been surpassed. We apply the method to three models: a mixture of Gaussians, logistic…

Citation impact

1,497

total citations

FWCI: 28.06
Percentile: 100%
References: 13

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Overfitting
Posterior probability
Computer science
Iterated function
Bayesian probability
Monte Carlo method
Gibbs sampling
Artificial intelligence

UN Sustainable Development Goals

Life below water

No related works found for this paper.