Markov Boundary Discovery with Ridge Regularized Linear Models

09/14/2015
by   Eric V. Strobl, et al.
0

Ridge regularized linear models (RRLMs), such as ridge regression and the SVM, are a popular group of methods that are used in conjunction with coefficient hypothesis testing to discover explanatory variables with a significant multivariate association to a response. However, many investigators are reluctant to draw causal interpretations of the selected variables due to the incomplete knowledge of the capabilities of RRLMs in causal inference. Under reasonable assumptions, we show that a modified form of RRLMs can get very close to identifying a subset of the Markov boundary by providing a worst-case bound on the space of possible solutions. The results hold for any convex loss, even when the underlying functional relationship is nonlinear, and the solution is not unique. Our approach combines ideas in Markov boundary and sufficient dimension reduction theory. Experimental results show that the modified RRLMs are competitive against state-of-the-art algorithms in discovering part of the Markov boundary from gene expression data.

READ FULL TEXT
research
10/17/2020

Markov Neighborhood Regression for High-Dimensional Inference

This paper proposes an innovative method for constructing confidence int...
research
02/01/2018

Dimension Reduction via Gaussian Ridge Functions

Ridge functions have recently emerged as a powerful set of ideas for sub...
research
03/03/2021

Ridge-penalized adaptive Mantel test and its application in imaging genetics

We propose a ridge-penalized adaptive Mantel test (AdaMant) for evaluati...
research
11/13/2017

On the boundary between qualitative and quantitative methods for causal inference

We consider how to quantify the causal effect from a random variable to ...
research
09/12/2023

Learning Minimalistic Tsetlin Machine Clauses with Markov Boundary-Guided Pruning

A set of variables is the Markov blanket of a random variable if it cont...
research
03/14/2023

Testing Causality for High Dimensional Data

Determining causal relationship between high dimensional observations ar...

Please sign up or login with your details

Forgot password? Click here to reset