preprintarXiv (Cornell University)Nov 3, 2016GREEN OA

Categorical Reparameterization with Gumbel-Softmax

Indexed inarxivdatacite

Abstract

Categorical variables are a natural choice for representing discrete structure in the world. However, stochastic neural networks rarely use categorical latent variables due to the inability to backpropagate through samples. In this work, we present an efficient gradient estimator that replaces the non-differentiable sample from a categorical distribution with a differentiable sample from a novel Gumbel-Softmax distribution. This distribution has the essential property that it can be smoothly annealed into a categorical distribution. We show that our Gumbel-Softmax estimator outperforms state-of-the-art gradient estimators on structured output prediction and unsupervised generative modeling tasks with…

Citation impact

3,230
total citations
FWCI
Percentile
References
0
Citations per year

Authors

3

Topics & keywords

Keywords
  • Categorical variable
  • Gumbel distribution
  • Softmax function
  • Estimator
  • Latent variable
  • Artificial intelligence
  • Categorical distribution
  • Mathematics
No related works found for this paper.