Learning to Increase the Power of Conditional Randomization Tests

07/03/2022
by   Shalev Shaer, et al.
3

The model-X conditional randomization test is a generic framework for conditional independence testing, unlocking new possibilities to discover features that are conditionally associated with a response of interest while controlling type-I error rates. An appealing advantage of this test is that it can work with any machine learning model to design powerful test statistics. In turn, the common practice in the model-X literature is to form a test statistic using machine learning models, trained to maximize predictive accuracy with the hope to attain a test with good power. However, the ideal goal here is to drive the model (during training) to maximize the power of the test, not merely the predictive accuracy. In this paper, we bridge this gap by introducing, for the first time, novel model-fitting schemes that are designed to explicitly improve the power of model-X tests. This is done by introducing a new cost function that aims at maximizing the test statistic used to measure violations of conditional independence. Using synthetic and real data sets, we demonstrate that the combination of our proposed loss function with various base predictive models (lasso, elastic net, and deep neural networks) consistently increases the number of correct discoveries obtained, while maintaining type-I error rates under control.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/18/2022

DIET: Conditional independence testing with marginal dependence measures of residual information

Conditional randomization tests (CRTs) assess whether a variable x is pr...
research
06/06/2020

Fast and Powerful Conditional Randomization Testing via Distillation

In relating a response variable Y to covariates (Z,X), a key question is...
research
07/31/2019

Conditional independence testing: a predictive perspective

Conditional independence testing is a key problem required by many machi...
research
11/08/2021

The Weighted Generalised Covariance Measure

We introduce a new test for conditional independence which is based on w...
research
07/14/2022

From Shapley back to Pearson: Hypothesis Testing via the Shapley Value

Machine learning models, in particular artificial neural networks, are i...
research
07/05/2023

Conditional independence testing under model misspecification

Conditional independence (CI) testing is fundamental and challenging in ...
research
05/17/2022

Individualized conditional independence testing under model-X with heterogeneous samples and interactions

Model-X knockoffs and the conditional randomization test are methods tha...

Please sign up or login with your details

Forgot password? Click here to reset