Generalization and Invariances in the Presence of Unobserved Confounding

07/21/2020
by   Alexis Bellot, et al.
8

The ability to extrapolate, or generalize, from observed to new related environments is central to any form of reliable machine learning, yet most methods fail when moving beyond i.i.d data. In some cases, the reason lies in a misappreciation of the causal structure that governs the observed data. But, in others, it is unobserved data, such as hidden confounders, that drive changes in observed distributions and distort observed correlations. In this paper, we argue that generalization must be defined with respect to a broader class of distribution shifts, irrespective of their origin (arising from changes in observed, unobserved or target variables). We propose a new learning principle from which we may expect an explicit notion of generalization to certain new environments, even in the presence of hidden confounding. This principle leads us to formulate a general objective that may be paired with any gradient-based learning algorithm; algorithms that have a causal interpretation in some cases and enjoy notions of predictive stability in others. We demonstrate the empirical performance of our approach on healthcare data from different modalities, including image and speech data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2021

Deconfounded Score Method: Scoring DAGs with Dense Unobserved Confounding

Unobserved confounding is one of the greatest challenges for causal disc...
research
07/29/2022

Treatment Effect Estimation with Unobserved and Heterogeneous Confounding Variables

The estimation of the treatment effect is often biased in the presence o...
research
05/27/2022

Combining observational datasets from multiple environments to detect hidden confounding

A common assumption in causal inference from observational data is the a...
research
04/16/2023

Out-of-Variable Generalization

The ability of an agent to perform well in new and unseen environments i...
research
12/12/2020

On Proximal Causal Learning with Many Hidden Confounders

We generalize the proximal g-formula of Miao, Geng, and Tchetgen Tchetge...
research
09/22/2021

Causal Discovery in High-Dimensional Point Process Networks with Hidden Nodes

Thanks to technological advances leading to near-continuous time observa...
research
02/03/2022

Exploiting Independent Instruments: Identification and Distribution Generalization

Instrumental variable models allow us to identify a causal function betw...

Please sign up or login with your details

Forgot password? Click here to reset