Marginal and Interactive Feature Screening of Ultra-high Dimensional Feature Spaces with Multivariate Response

11/16/2019
by   Randall Reese, et al.
0

When the number of features exponentially outnumbers the number of samples, feature screening plays a pivotal role in reducing the dimension of the feature space and developing models based on such data. While most extant feature screening approaches are only applicable to data having univariate response, we propose a new method (GenCorr) that admits a multivariate response. Such an approach allows us to more appropriately model multiple responses as a single unit, rather than as unrelated entities, which avails more robust analyses in relation to complex traits embedded in the covariance structure of multiple responses. The GenCorr framework allows for the screening of both marginal as well as interactive features. It is demonstrated that GenCorr possesses the desirable property of strong sure screening. In the marginal case, we examine the superior numerical performance of GenCorr in comparison to two current methods for multivariate marginal screening via an assortment of empirical simulations. We also present several simulations inspecting GenCorr's performance in multivariate interaction screening. A culminating real data analysis demonstrates the performance of our method on GWAS data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2018

Strong Sure Screening of Ultra-high Dimensional Categorical Data

Feature screening for ultra high dimensional feature spaces plays a crit...
research
01/23/2018

Strong Sure Screening of Ultra-high Dimensional Data with Interaction Effects

Ultrahigh dimensional data sets are becoming increasingly prevalent in a...
research
06/04/2022

Feature screening for multi-response linear models by empirical likelihood

This paper proposes a new feature screening method for the multi-respons...
research
02/07/2021

RaSE: A Variable Screening Framework via Random Subspace Ensembles

Variable screening methods have been shown to be effective in dimension ...
research
08/19/2019

Model-free Feature Screening and FDR Control with Knockoff Features

This paper proposes a model-free and data-adaptive feature screening met...
research
06/21/2022

BiometricBlender: Ultra-high dimensional, multi-class synthetic data generator to imitate biometric feature space

The lack of freely available (real-life or synthetic) high or ultra-high...
research
04/21/2022

Ultra-marginal Feature Importance

Scientists frequently prioritize learning from data rather than training...

Please sign up or login with your details

Forgot password? Click here to reset