Automatic Differentiation Variational Inference with Mixtures

03/03/2020
by   Warren R. Morningstar, et al.
0

Automatic Differentiation Variational Inference (ADVI) is a useful tool for efficiently learning probabilistic models in machine learning. Generally approximate posteriors learned by ADVI are forced to be unimodal in order to facilitate use of the reparameterization trick. In this paper, we show how stratified sampling may be used to enable mixture distributions as the approximate posterior, and derive a new lower bound on the evidence analogous to the importance weighted autoencoder (IWAE). We show that this “SIWAE” is a tighter bound than both IWAE and the traditional ELBO, both of which are special instances of this bound. We verify empirically that the traditional ELBO objective disfavors the presence of multimodal posterior distributions and may therefore not be able to fully capture structure in the latent space. Our experiments show that using the SIWAE objective allows the encoder to learn more complex distributions which regularly contain multimodality, resulting in higher accuracy and better calibration in the presence of incomplete, limited, or corrupted data.

READ FULL TEXT

page 7

page 9

research
05/08/2019

Importance Weighted Hierarchical Variational Inference

Variational Inference is a powerful tool in the Bayesian modeling toolki...
research
03/27/2017

Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference

We propose a simple and general variant of the standard reparameterized ...
research
09/06/2018

Improving Explorability in Variational Inference with Annealed Variational Objectives

Despite the advances in the representational capacity of approximate dis...
research
06/11/2021

Posterior Temperature Optimization in Variational Inference

Cold posteriors have been reported to perform better in practice in the ...
research
06/22/2022

Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph Generation

As a structured prediction task, scene graph generation, given an input ...
research
05/14/2022

Importance Weighted Structure Learning for Scene Graph Generation

Scene graph generation is a structured prediction task aiming to explici...
research
01/14/2019

Remarks on stochastic automatic adjoint differentiation and financial models calibration

In this work, we discuss the Automatic Adjoint Differentiation (AAD) for...

Please sign up or login with your details

Forgot password? Click here to reset