Categorical Reparameterization with Gumbel-Softmax

11/03/2016
by   Eric Jang, et al.
Google
University of Cambridge
Stanford University
0

Categorical variables are a natural choice for representing discrete structure in the world. However, stochastic neural networks rarely use categorical latent variables due to the inability to backpropagate through samples. In this work, we present an efficient gradient estimator that replaces the non-differentiable sample from a categorical distribution with a differentiable sample from a novel Gumbel-Softmax distribution. This distribution has the essential property that it can be smoothly annealed into a categorical distribution. We show that our Gumbel-Softmax estimator outperforms state-of-the-art gradient estimators on structured output prediction and unsupervised generative modeling tasks with categorical latent variables, and enables large speedups on semi-supervised classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

07/03/2020

Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity

Training neural network models with discrete (categorical or structured)...
05/09/2022

ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence

The Gumbel-softmax distribution, or Concrete distribution, is often used...
09/08/2022

Stochastic gradient descent with gradient estimator for categorical features

Categorical data are present in key areas such as health or supply chain...
06/27/2019

Efficient algorithms for modifying and sampling from a categorical distribution

Probabilistic programming languages and other machine learning applicati...
07/08/2020

StructureBoost: Efficient Gradient Boosting for Structured Categorical Variables

Gradient boosting methods based on Structured Categorical Decision Trees...
04/17/2023

Bridging Discrete and Backpropagation: Straight-Through and Beyond

Backpropagation, the cornerstone of deep learning, is limited to computi...
06/09/2020

A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers

Neural Network (NN) classifiers can assign extreme probabilities to samp...

Code Repositories

gumbel-softmax

categorical variational autoencoder using the Gumbel-Softmax estimator


view repo

Please sign up or login with your details

Forgot password? Click here to reset