Stable Prediction via Leveraging Seed Variable

06/09/2020
by   Kun Kuang, et al.
28

In this paper, we focus on the problem of stable prediction across unknown test data, where the test distribution is agnostic and might be totally different from the training one. In such a case, previous machine learning methods might exploit subtly spurious correlations in training data induced by non-causal variables for prediction. Those spurious correlations are changeable across data, leading to instability of prediction across data. By assuming the relationships between causal variables and response variable are invariant across data, to address this problem, we propose a conditional independence test based algorithm to separate those causal variables with a seed variable as priori, and adopt them for stable prediction. By assuming the independence between causal and non-causal variables, we show, both theoretically and with empirical experiments, that our algorithm can precisely separate causal and non-causal variables for stable prediction across test data. Extensive experiments on both synthetic and real-world datasets demonstrate that our algorithm outperforms state-of-the-art methods for stable prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2020

Balance-Subsampled Stable Prediction

In machine learning, it is commonly assumed that training and test data ...
research
06/16/2018

Stable Prediction across Unknown Environments

In many important machine learning applications, the training distributi...
research
12/02/2022

Stable Learning via Sparse Variable Independence

The problem of covariate-shift generalization has attracted intensive re...
research
11/29/2022

Towards Dynamic Causal Discovery with Rare Events: A Nonparametric Conditional Independence Test

Causal phenomena associated with rare events occur across a wide range o...
research
03/08/2021

Size-Invariant Graph Representations for Graph Classification Extrapolations

In general, graph representation learning methods assume that the test a...
research
10/07/2020

Exploiting non-i.i.d. data towards more robust machine learning algorithms

In the field of machine learning there is a growing interest towards mor...

Please sign up or login with your details

Forgot password? Click here to reset