Bayesian iterative screening in ultra-high dimensional settings

07/21/2021
by   Run Wang, et al.
0

Variable selection in ultra-high dimensional linear regression is often preceded by a screening step to significantly reduce the dimension. Here a Bayesian variable screening method (BITS) is developed. BITS can successfully integrate prior knowledge, if any, on effect sizes, and the number of true variables. BITS iteratively includes potential variables with the highest posterior probability accounting for the already selected variables. It is implemented by a fast Cholesky update algorithm and is shown to have the screening consistency property. BITS is built based on a model with Gaussian errors, yet, the screening consistency is proved to hold under more general tail conditions. The notion of posterior screening consistency allows the resulting model to provide a good starting point for further Bayesian variable selection methods. A new screening consistent stopping rule based on posterior probability is developed. Simulation studies and real data examples are used to demonstrate scalability and fine screening performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2011

Independent screening for single-index hazard rate models with ultra-high dimensional features

In data sets with many more features than observations, independent scre...
research
02/24/2015

On the consistency theory of high dimensional variable screening

Variable screening is a fast dimension reduction technique for assisting...
research
06/13/2020

Model Based Screening Embedded Bayesian Variable Selection for Ultra-high Dimensional Settings

We develop a Bayesian variable selection method, called SVEN, based on a...
research
12/17/2010

Ultra-high Dimensional Multiple Output Learning With Simultaneous Orthogonal Matching Pursuit: A Sure Screening Approach

We propose a novel application of the Simultaneous Orthogonal Matching P...
research
09/06/2021

Screening the Discrepancy Function of a Computer Model

Screening traditionally refers to the problem of detecting active inputs...
research
12/04/2012

Better subset regression

To find efficient screening methods for high dimensional linear regressi...
research
06/15/2023

Conditional variable screening for ultra-high dimensional longitudinal data with time interactions

In recent years we have been able to gather large amounts of genomic dat...

Please sign up or login with your details

Forgot password? Click here to reset