Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator

10/09/2020
by   Max B. Paulus, et al.
8

Gradient estimation in models with discrete latent variables is a challenging problem, because the simplest unbiased estimators tend to have high variance. To counteract this, modern estimators either introduce bias, rely on multiple function evaluations, or use learned, input-dependent baselines. Thus, there is a need for estimators that require minimal tuning, are computationally cheap, and have low mean squared error. In this paper, we show that the variance of the straight-through variant of the popular Gumbel-Softmax estimator can be reduced through Rao-Blackwellization without increasing the number of function evaluations. This provably reduces the mean squared error. We empirically demonstrate that this leads to variance reduction, faster convergence, and generally improved performance in two unsupervised latent variable models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2018

Optimal mean squared error bandwidth for spectral variance estimators in MCMC simulations

This paper proposes optimal mean squared error bandwidths for a family o...
research
06/15/2020

Gradient Estimation with Stochastic Softmax Tricks

The Gumbel-Max trick is the basis of many relaxed gradient estimators. T...
research
06/10/2019

Stretching the Effectiveness of MLE from Accuracy to Bias for Pairwise Comparisons

A number of applications (e.g., AI bot tournaments, sports, peer grading...
research
08/12/2018

A Fourier View of REINFORCE

We show a connection between the Fourier spectrum of Boolean functions a...
research
10/24/2021

Learning to Estimate Without Bias

We consider the use of deep learning for parameter estimation. We propos...
research
12/10/2019

Variable selection for transportability

Transportability provides a principled framework to address the problem ...
research
12/07/2018

Multitaper estimation on arbitrary domains

Multitaper estimators have enjoyed significant success in providing spec...

Please sign up or login with your details

Forgot password? Click here to reset