Model-Free, Monotone Invariant and Computationally Efficient Feature Screening with Data-adaptive Threshold

07/27/2022
by   Linsui Deng, et al.
0

Feature screening for ultrahigh-dimension, in general, proceeds with two essential steps. The first step is measuring and ranking the marginal dependence between response and covariates, and the second is determining the threshold. We develop a new screening procedure, called SIT-BY procedure, that possesses appealing statistical properties in both steps. By employing sliced independence estimates in the measuring and ranking stage, our proposed procedure requires no model assumptions, remains invariant to monotone transformation, and achieves almost linear computation complexity. Inspired by false discovery rate (FDR) control procedures, we offer a data-adaptive threshold benefit from the asymptotic normality of test statistics. Under moderate conditions, we demonstrate that our procedure can asymptotically control the FDR while maintaining the sure screening property. We investigate the finite sample performance of our proposed procedure via extensive simulations and an application to genome-wide dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2023

Grouped feature screening for ultrahigh-dimensional classification via Gini distance correlation

Gini distance correlation (GDC) was recently proposed to measure the dep...
research
02/27/2020

False Discovery Rate Control Under General Dependence By Symmetrized Data Aggregation

We develop a new class of distribution–free multiple testing rules for f...
research
11/03/2019

Optimal two-stage testing of multiple mediators

Mediation analysis in high-dimensional settings often involves identifyi...
research
08/31/2022

Two-stage Hypothesis Tests for Variable Interactions with FDR Control

In many scenarios such as genome-wide association studies where dependen...
research
08/19/2019

Model-free Feature Screening and FDR Control with Knockoff Features

This paper proposes a model-free and data-adaptive feature screening met...
research
12/04/2018

On the sure screening property of the iterative sure independence screening algorithm

The iterative version of the sure independence screening algorithm (ISIS...
research
12/27/2022

Weak Signal Inclusion Under Dependence and Applications in Genome-wide Association Study

Motivated by the inquiries of weak signals in underpowered genome-wide a...

Please sign up or login with your details

Forgot password? Click here to reset