Covariance-Insured Screening

05/17/2018
by   Kevin He, et al.
0

Modern bio-technologies have produced a vast amount of high-throughput data with the number of predictors far greater than the sample size. In order to identify more novel biomarkers and understand biological mechanisms, it is vital to detect signals weakly associated with outcomes among ultrahigh-dimensional predictors. However, existing screening methods, which typically ignore correlation information, are likely to miss these weak signals. By incorporating the inter-feature dependence, we propose a covariance-insured screening methodology to identify predictors that are jointly informative but only marginally weakly associated with outcomes. The validity of the method is examined via extensive simulations and real data studies for selecting potential genetic factors related to the onset of cancer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2021

Structured Bayesian variable selection for multiple related response variables and high-dimensional predictors

It is becoming increasingly common to study the complex association betw...
research
02/07/2021

RaSE: A Variable Screening Framework via Random Subspace Ensembles

Variable screening methods have been shown to be effective in dimension ...
research
08/10/2020

A note of feature screening via rank-based coefficient of correlation

Feature screening is useful and popular to detect informative predictors...
research
01/10/2018

Strong Sure Screening of Ultra-high Dimensional Categorical Data

Feature screening for ultra high dimensional feature spaces plays a crit...
research
09/21/2021

A Model-free Variable Screening Method Based on Leverage Score

With rapid advances in information technology, massive datasets are coll...
research
12/27/2022

Weak Signal Inclusion Under Dependence and Applications in Genome-wide Association Study

Motivated by the inquiries of weak signals in underpowered genome-wide a...
research
07/13/2022

Fitting Semiparametric Cumulative Probability Models for Big Data

Cumulative probability models (CPMs) are a robust alternative to linear ...

Please sign up or login with your details

Forgot password? Click here to reset