Predictive Correlation Screening: Application to Two-stage Predictor Design in High Dimension

03/10/2013
by   Hamed Firouzi, et al.
0

We introduce a new approach to variable selection, called Predictive Correlation Screening, for predictor design. Predictive Correlation Screening (PCS) implements false positive control on the selected variables, is well suited to small sample sizes, and is scalable to high dimensions. We establish asymptotic bounds for Familywise Error Rate (FWER), and resultant mean square error of a linear predictor on the selected variables. We apply Predictive Correlation Screening to the following two-stage predictor design problem. An experimenter wants to learn a multivariate predictor of gene expressions based on successive biological samples assayed on mRNA arrays. She assays the whole genome on a few samples and from these assays she selects a small number of variables using Predictive Correlation Screening. To reduce assay cost, she subsequently assays only the selected variables on the remaining samples, to learn the predictor coefficients. We show superiority of Predictive Correlation Screening relative to LASSO and correlation learning (sometimes popularly referred to in the literature as marginal regression or simple thresholding) in terms of performance and computational complexity.

READ FULL TEXT
research
02/22/2015

Two-stage Sampling, Prediction and Adaptive Regression via Correlation Screening (SPARCS)

This paper proposes a general adaptive procedure for budget-limited pred...
research
02/06/2011

Large Scale Correlation Screening

This paper treats the problem of screening for variables with high corre...
research
12/30/2017

An ISIS screening approach involving threshold/partition for variable selection in linear regression

In linear regression, one can select a predictor if the absolute sample ...
research
04/20/2021

Screening methods for linear errors-in-variables models in high dimensions

Microarray studies, in order to identify genes associated with an outcom...
research
09/23/2022

Sure Screening for Transelliptical Graphical Models

We propose a sure screening approach for recovering the structure of a t...
research
11/05/2021

Compressed spectral screening for large-scale differential correlation analysis with application in selecting Glioblastoma gene modules

Differential co-expression analysis has been widely applied by scientist...
research
11/02/2022

Inferring independent sets of Gaussian variables after thresholding correlations

We consider testing whether a set of Gaussian variables, selected from t...

Please sign up or login with your details

Forgot password? Click here to reset