Causal Graphs Underlying Generative Models: Path to Learning with Limited Data

07/14/2022
by   Samuel C. Hoffman, et al.
4

Training generative models that capture rich semantics of the data and interpreting the latent representations encoded by such models are very important problems in unsupervised learning. In this work, we provide a simple algorithm that relies on perturbation experiments on latent codes of a pre-trained generative autoencoder to uncover a causal graph that is implied by the generative model. We leverage pre-trained attribute classifiers and perform perturbation experiments to check for influence of a given latent variable on a subset of attributes. Given this, we show that one can fit an effective causal graph that models a structural equation model between latent codes taken as exogenous variables and attributes taken as observed variables. One interesting aspect is that a single latent variable controls multiple overlapping subsets of attributes unlike conventional approach that tries to impose full independence. Using a pre-trained RNN-based generative autoencoder trained on a dataset of peptide sequences, we demonstrate that the learnt causal graph from our algorithm between various attributes and latent codes can be used to predict a specific property for sequences which are unseen. We compare prediction models trained on either all available attributes or only the ones in the Markov blanket and empirically show that in both the unsupervised and supervised regimes, typically, using the predictor that relies on Markov blanket attributes generalizes better for out-of-distribution sequences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2020

Exponential Tilting of Generative Models: Improving Sample Quality by Training and Sampling from Latent Energy

In this paper, we present a general method that can improve the sample q...
research
09/18/2023

Learning to Generate Lumped Hydrological Models

In a lumped hydrological model structure, the hydrological function of a...
research
07/26/2020

Data-efficient visuomotor policy training using reinforcement learning and generative models

We present a data-efficient framework for solving deep visuomotor sequen...
research
04/18/2022

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

We present a data-efficient framework for solving sequential decision-ma...
research
02/27/2017

Boosted Generative Models

We propose a new approach for using unsupervised boosting to create an e...
research
09/18/2021

PluGeN: Multi-Label Conditional Generation From Pre-Trained Models

Modern generative models achieve excellent quality in a variety of tasks...
research
05/15/2023

Causal Analysis for Robust Interpretability of Neural Networks

Interpreting the inner function of neural networks is crucial for the tr...

Please sign up or login with your details

Forgot password? Click here to reset