Contrastive Mixture of Posteriors for Counterfactual Inference, Data Integration and Fairness

06/15/2021
by   Adam Foster, et al.
0

Learning meaningful representations of data that can address challenges such as batch effect correction, data integration and counterfactual inference is a central problem in many domains including computational biology. Adopting a Conditional VAE framework, we identify the mathematical principle that unites these challenges: learning a representation that is marginally independent of a condition variable. We therefore propose the Contrastive Mixture of Posteriors (CoMP) method that uses a novel misalignment penalty to enforce this independence. This penalty is defined in terms of mixtures of the variational posteriors themselves, unlike prior work which uses external discrepancy measures such as MMD to ensure independence in latent space. We show that CoMP has attractive theoretical properties compared to previous approaches, especially when there is complex global structure in latent space. We further demonstrate state of the art performance on a number of real-world problems, including the challenging tasks of aligning human tumour samples with cancer cell-lines and performing counterfactual inference on single-cell RNA sequencing data. Incidentally, we find parallels with the fair representation learning literature, and demonstrate CoMP has competitive performance in learning fair yet expressive latent representations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/19/2022

Disentangled Representation with Causal Constraints for Counterfactual Fairness

Much research has been devoted to the problem of learning fair represent...
research
06/17/2022

Learning Fair Representation via Distributional Contrastive Disentanglement

Learning fair representation is crucial for achieving fairness or debias...
research
03/15/2023

DualFair: Fair Representation Learning at Both Group and Individual Levels via Contrastive Self-supervision

Algorithmic fairness has become an important machine learning problem, e...
research
09/15/2022

Fair Inference for Discrete Latent Variable Models

It is now well understood that machine learning models, trained on data ...
research
05/12/2016

Learning Representations for Counterfactual Inference

Observational studies are rising in importance due to the widespread acc...
research
05/22/2018

Information Constraints on Auto-Encoding Variational Bayes

Parameterizing the approximate posterior of a generative model with neur...
research
11/03/2015

The Variational Fair Autoencoder

We investigate the problem of learning representations that are invarian...

Please sign up or login with your details

Forgot password? Click here to reset