Returning The Favour: When Regression Benefits From Probabilistic Causal Knowledge

01/26/2023
by   Shahine Bouabid, et al.
0

A directed acyclic graph (DAG) provides valuable prior knowledge that is often discarded in regression tasks in machine learning. We show that the independences arising from the presence of collider structures in DAGs provide meaningful inductive biases, which constrain the regression hypothesis space and improve predictive performance. We introduce collider regression, a framework to incorporate probabilistic causal knowledge from a collider in a regression problem. When the hypothesis space is a reproducing kernel Hilbert space, we prove a strictly positive generalisation benefit under mild assumptions and provide closed-form estimators of the empirical risk minimiser. Experiments on synthetic and climate model data demonstrate performance gains of the proposed methodology.

READ FULL TEXT

page 21

page 22

page 23

research
09/09/2020

Consistency and Regression with Laplacian regularization in Reproducing Kernel Hilbert Space

This note explains a way to look at reproducing kernel Hilbert space for...
research
06/27/2012

Incorporating Causal Prior Knowledge as Path-Constraints in Bayesian Networks and Maximal Ancestral Graphs

We consider the incorporation of causal knowledge about the presence or ...
research
06/01/2019

Kernel Instrumental Variable Regression

Instrumental variable regression is a strategy for learning causal relat...
research
06/27/2018

Distribution regression model with a Reproducing Kernel Hilbert Space approach

In this paper, we introduce a new distribution regression model for prob...
research
04/19/2021

Robust Uncertainty Bounds in Reproducing Kernel Hilbert Spaces: A Convex Optimization Approach

Let a labeled dataset be given with scattered samples and consider the h...
research
02/27/2021

Incorporating Causal Graphical Prior Knowledge into Predictive Modeling via Simple Data Augmentation

Causal graphs (CGs) are compact representations of the knowledge of the ...
research
01/28/2019

An analytic formulation for positive-unlabeled learning via weighted integral probability metric

We consider the problem of learning a binary classifier from only positi...

Please sign up or login with your details

Forgot password? Click here to reset