A note of feature screening via rank-based coefficient of correlation

08/10/2020
by   Li-Pang Chen, et al.
0

Feature screening is useful and popular to detect informative predictors for ultrahigh-dimensional data before developing proceeding statistical analysis or constructing statistical models. While a large body of feature screening procedures has been developed, most of them are restricted on examining either continuous or discrete responses. Moreover, even though many model-free feature screening methods have been proposed, additional assumptions are imposed in those methods to ensure their theoretical results. To address those difficulties and provide simple implementation, in this paper we extend the rank-based coefficient of correlation proposed by Chatterjee (2020) to develop feature screening procedure. We show that this new screening criterion is able to deal with continuous and discrete responses. Theoretically, sure screening property is established to justify the proposed method. Simulation studies demonstrate that the predictors with nonlinear and oscillatory trajectory are successfully detected regardless of the distribution of the response.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2021

Distribution-free and Model-free Multivariate Feature Screening via Multivariate Rank Distance Correlation

Feature screening approaches are effective in selecting active features ...
research
08/13/2022

A sequential stepwise screening procedure for sparse recovery in high-dimensional multiresponse models with complex group structures

Multiresponse data with complex group structures in both responses and p...
research
02/07/2021

RaSE: A Variable Screening Framework via Random Subspace Ensembles

Variable screening methods have been shown to be effective in dimension ...
research
06/04/2022

Feature screening for multi-response linear models by empirical likelihood

This paper proposes a new feature screening method for the multi-respons...
research
11/12/2021

Epistasis Detection Via the Joint Cumulant

Selecting influential nonlinear interactive features from ultrahigh dime...
research
05/17/2018

Covariance-Insured Screening

Modern bio-technologies have produced a vast amount of high-throughput d...
research
09/21/2021

A Model-free Variable Screening Method Based on Leverage Score

With rapid advances in information technology, massive datasets are coll...

Please sign up or login with your details

Forgot password? Click here to reset