Learning Invariant Representations with Missing Data

12/01/2021
by   Mark Goldstein, et al.
0

Spurious correlations allow flexible models to predict well during training but poorly on related test populations. Recent work has shown that models that satisfy particular independencies involving correlation-inducing nuisance variables have guarantees on their test performance. Enforcing such independencies requires nuisances to be observed during training. However, nuisances, such as demographics or image background labels, are often missing. Enforcing independence on just the observed data does not imply independence on the entire population. Here we derive mmd estimators used for invariance objectives under missing nuisances. On simulations and clinical data, optimizing through these estimates achieves test performance similar to using estimators that make use of the full data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2016

Recoverability of Joint Distribution from Missing Data

A probabilistic query may not be estimable from observed data corrupted ...
research
07/11/2018

Causal discovery in the presence of missing data

Missing data are ubiquitous in many domains such as healthcare. Dependin...
research
10/19/2021

Riemannian classification of EEG signals with missing values

This paper proposes two strategies to handle missing data for the classi...
research
06/10/2023

Sufficient Identification Conditions and Semiparametric Estimation under Missing Not at Random Mechanisms

Conducting valid statistical analyses is challenging in the presence of ...
research
01/22/2021

Revisiting Identifying Assumptions for Population Size Estimation

The problem of estimating the size of a population based on a subset of ...
research
04/03/2021

Training Deep Normalizing Flow Models in Highly Incomplete Data Scenarios with Prior Regularization

Deep generative frameworks including GANs and normalizing flow models ha...
research
02/28/2019

Deductive semiparametric estimation in Double-Sampling Designs with application to PEPFAR

Robust estimators in missing data problems often use semiparametric esti...

Please sign up or login with your details

Forgot password? Click here to reset