ECKO: Ensemble of Clustered Knockoffs for multivariate inference on fMRI data

03/12/2019
by   Tuan-Binh Nguyen, et al.
0

Continuous improvement in medical imaging techniques allows the acquisition of higher-resolution images. When these are used in a predictive setting, a greater number of explanatory variables are potentially related to the dependent variable (the response). Meanwhile, the number of acquisitions per experiment remains limited. In such high dimension/small sample size setting, it is desirable to find the explanatory variables that are truly related to the response while controlling the rate of false discoveries. To achieve this goal, novel multivariate inference procedures, such as knockoff inference, have been proposed recently. However, they require the feature covariance to be well-defined, which is impossible in high-dimensional settings. In this paper, we propose a new algorithm, called Ensemble of Clustered Knockoffs, that allows to select explanatory variables while controlling the false discovery rate (FDR), up to a prescribed spatial tolerance. The core idea is that knockoff-based inference can be applied on groups (clusters) of voxels, which drastically reduces the problem's dimension; an ensembling step then removes the dependence on a fixed clustering and stabilizes the results. We benchmark this algorithm and other FDR-controlling methods on brain imaging datasets and observe empirical gains in sensitivity, while the false discovery rate is controlled at the nominal level.

READ FULL TEXT

page 10

page 11

research
09/27/2022

False Discovery Rate Adjustments for Average Significance Level Controlling Tests

Multiple testing adjustments, such as the Benjamini and Hochberg (1995) ...
research
06/15/2018

Statistical Inference with Ensemble of Clustered Desparsified Lasso

Medical imaging involves high-dimensional data, yet their acquisition is...
research
02/10/2021

Bayesian Knockoff Filter Using Gibbs Sampler

In many fields, researchers are interested in discovering features with ...
research
11/21/2019

Controlling False Discovery Rate Using Gaussian Mirrors

Simultaneously finding multiple influential variables and controlling th...
research
05/02/2021

Directional FDR Control for Sub-Gaussian Sparse GLMs

High-dimensional sparse generalized linear models (GLMs) have emerged in...
research
04/28/2022

Controlling the False Discovery Rate via knockoffs: is the +1 needed?

Barber and Candès (2015) control of the FDR in feature selection relies ...
research
06/04/2021

Spatially relaxed inference on high-dimensional linear models

We consider the inference problem for high-dimensional linear models, wh...

Please sign up or login with your details

Forgot password? Click here to reset