Abstract
We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop. It regularises the weights by minimising a compression cost, known as the variational free energy or the expected lower bound on the marginal likelihood. We show that this principled kind of regularisation yields comparable performance to dropout on MNIST classification. We then demonstrate how the learnt uncertainty in the weights can be used to improve generalisation in non-linear regression problems, and how this weight uncertainty can be used to drive the exploration-exploitation trade-off in reinforcement learning.
Citation impact
- FWCI
- 18.15
- Percentile
- 100%
- References
- 40
Authors
4- CBCharles BlundellCorresponding
Google (United States), Google DeepMind (United Kingdom)
- JCJulien Cornebise
Google (United States), Google DeepMind (United Kingdom)
- KKKoray Kavukcuoglu
Google (United States), Google DeepMind (United Kingdom)
- DWDaan Wierstra
Google (United States), Google DeepMind (United Kingdom)
Topics & keywords
- MNIST database
- Dropout (neural networks)
- Backpropagation
- Artificial neural network
- Computer science
- Artificial intelligence
- Reinforcement learning
- Upper and lower bounds
- Affordable and clean energy