Variable screening with multiple studies

10/11/2017
by   Tianzhou Ma, et al.
0

Advancement in technology has generated abundant high-dimensional data that allows integration of multiple relevant studies. Due to their huge computational advantage, variable screening methods based on marginal correlation have become promising alternatives to the popular regularization methods for variable selection. However, all these screening methods are limited to single study so far. In this paper, we consider a general framework for variable screening with multiple related studies, and further propose a novel two-step screening procedure using a self-normalized estimator for high-dimensional regression analysis in this framework. Compared to the one-step procedure and rank-based sure independence screening (SIS) procedure, our procedure greatly reduces false negative errors while keeping a low false positive rate. Theoretically, we show that our procedure possesses the sure screening property with weaker assumptions on signal strengths and allows the number of features to grow at an exponential rate of the sample size. In addition, we relax the commonly used normality assumption and allow sub-Gaussian distributions. Simulations and a real transcriptomic application illustrate the advantage of our method as compared to the rank-based SIS method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2015

High-dimensional Ordinary Least-squares Projection for Screening Variables

Variable selection is a challenging issue in statistical applications wh...
research
02/27/2018

Sufficient variable screening via directional regression with censored response

We in this paper propose a directional regression based approach for ult...
research
12/26/2022

Robust distance correlation for variable screening

High-dimensional data are commonly seen in modern statistical applicatio...
research
04/30/2020

A robust variable screening procedure for ultra-high dimensional data

Variable selection in ultra-high dimensional regression problems has bec...
research
11/28/2017

Nonparametric Independence Screening via Favored Smoothing Bandwidth

We propose a flexible nonparametric regression method for ultrahigh-dime...
research
03/09/2019

Distributed Feature Screening via Componentwise Debiasing

Feature screening is a powerful tool in the analysis of high dimensional...
research
08/24/2021

A Generalized Knockoff Procedure for FDR Control in Structural Change Detection

Controlling false discovery rate (FDR) is crucial for variable selection...

Please sign up or login with your details

Forgot password? Click here to reset