DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation

09/07/2023
by   Pau Mulet Arabi, et al.
0

Computing gradients of an expectation with respect to the distributional parameters of a discrete distribution is a problem arising in many fields of science and engineering. Typically, this problem is tackled using Reinforce, which frames the problem of gradient estimation as a Monte Carlo simulation. Unfortunately, the Reinforce estimator is especially sensitive to discrepancies between the true probability distribution and the drawn samples, a common issue in low sampling regimes that results in inaccurate gradient estimates. In this paper, we introduce DBsurf, a reinforce-based estimator for discrete distributions that uses a novel sampling procedure to reduce the discrepancy between the samples and the actual distribution. To assess the performance of our estimator, we subject it to a diverse set of tasks. Among existing estimators, DBsurf attains the lowest variance in a least squares problem commonly used in the literature for benchmarking. Furthermore, DBsurf achieves the best results for training variational auto-encoders (VAE) across different datasets and sampling setups. Finally, we apply DBsurf to build a simple and efficient Neural Architecture Search (NAS) algorithm with state-of-the-art performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2022

Gradient Estimation with Discrete Stein Operators

Gradient estimation – approximating the gradient of an expectation with ...
research
06/10/2020

Multi-index Antithetic Stochastic Gradient Algorithm

Stochastic Gradient Algorithms (SGAs) are ubiquitous in computational st...
research
02/14/2020

Estimating Gradients for Discrete Random Variables by Sampling without Replacement

We derive an unbiased estimator for expectations over discrete random va...
research
10/04/2022

SIMPLE: A Gradient Estimator for k-Subset Sampling

k-subset sampling is ubiquitous in machine learning, enabling regulariza...
research
09/11/2022

Adaptive Perturbation-Based Gradient Estimation for Discrete Latent Variable Models

The integration of discrete algorithmic components in deep learning arch...
research
12/09/2021

Times Square sampling: an adaptive algorithm for free energy estimation

Estimating free energy differences, an important problem in computationa...
research
05/28/2021

ARMS: Antithetic-REINFORCE-Multi-Sample Gradient for Binary Variables

Estimating the gradients for binary variables is a task that arises freq...

Please sign up or login with your details

Forgot password? Click here to reset