The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

University of Toronto · Google (United States) · +1 more institution

Abstract

The reparameterization trick enables optimizing large scale stochastic computation graphs via gradient descent. The essence of the trick is to refactor each stochastic node into a differentiable function of its parameters and a random variable with fixed distribution. After refactoring, the gradients of the loss propagated by the chain rule through the graph are low variance unbiased estimators of the gradients of the expected loss. While many continuous random variables have such reparameterizations, discrete random variables lack useful reparameterizations due to the discontinuous nature of discrete states. In this work we introduce CONCRETE random variables—CONtinuous relaxations of disCRETE random…

Citation impact

622
total citations
FWCI
Percentile
References
36
Citations per year

Authors

3

Topics & keywords

Keywords
  • Random variable
  • Random graph
  • Computation
  • Applied mathematics
  • Discrete-time stochastic process
  • Estimator
  • Mathematics
  • Differentiable function
No related works found for this paper.