Unbiased Learning of Deep Generative Models with Structured Discrete Representations

06/14/2023
by   Harry Bendekgey, et al.
0

By composing graphical models with deep learning architectures, we learn generative models with the strengths of both frameworks. The structured variational autoencoder (SVAE) inherits structure and interpretability from graphical models, and flexible likelihoods for high-dimensional data from deep learning, but poses substantial optimization challenges. We propose novel algorithms for learning SVAEs, and are the first to demonstrate the SVAE's ability to handle multimodal uncertainty when data is missing by incorporating discrete latent variables. Our memory-efficient implicit differentiation scheme makes the SVAE tractable to learn via gradient descent, while demonstrating robustness to incomplete optimization. To more rapidly learn accurate graphical model parameters, we derive a method for computing natural gradients without manual derivations, which avoids biases found in prior work. These optimization innovations enable the first comparisons of the SVAE to state-of-the-art time series models, where the SVAE performs competitively while learning interpretable and structured discrete data representations.

READ FULL TEXT

page 2

page 8

page 16

research
11/22/2016

Inducing Interpretable Representations with Variational Autoencoders

We develop a framework for incorporating structured graphical models in ...
research
05/28/2018

Flexible and accurate inference and learning for deep generative models

We introduce a new approach to learning in hierarchical latent-variable ...
research
04/09/2013

High-dimensional Mixed Graphical Models

While graphical models for continuous data (Gaussian graphical models) a...
research
09/12/2022

Amortised Inference in Structured Generative Models with Explaining Away

A key goal of unsupervised learning is to go beyond density estimation a...
research
05/14/2019

NGO-GM: Natural Gradient Optimization for Graphical Models

This paper deals with estimating model parameters in graphical models. W...
research
03/14/2019

Deep Switch Networks for Generating Discrete Data and Language

Multilayer switch networks are proposed as artificial generators of high...
research
10/20/2022

Graphically Structured Diffusion Models

We introduce a framework for automatically defining and learning deep ge...

Please sign up or login with your details

Forgot password? Click here to reset