Adaptive Bayesian SLOPE – High-dimensional Model Selection with Missing Values

09/14/2019
by   Wei Jiang, et al.
0

The selection of variables with high-dimensional and missing data is a major challenge and very few methods are available to solve this problem. Here we propose a new method – adaptive Bayesian SLOPE – which is an extension of the SLOPE method of sorted l_1 regularization within a Bayesian framework and which allows to simultaneously estimate the parameters and select variables for large data despite missing values. The method follows the idea of the Spike and Slab LASSO, but replaces the Laplace mixture prior with the frequentist motivated "SLOPE" prior, which targets control of the False Discovery Rate. The regression parameters and the noise variance are estimated using stochastic approximation EM algorithm, which allows to incorporate missing values as well as latent model parameters, like the signal magnitude and its sparsity. Extensive simulations highlight the good behavior in terms of power, FDR and estimation bias under a wide range of simulation scenarios. Finally, we consider an application of severely traumatized patients from Paris hospitals to predict the level of platelet, and demonstrate, beyond the advantage of selecting relevant variables, which is crucial for interpretation, excellent predictive capabilities. The methodology is implemented in the R package ABSLOPE, which incorporates C++ code to improve the efficiency of the proposed method.

READ FULL TEXT
research
05/11/2018

Stochastic Approximation EM for Logistic Regression with Missing Values

Logistic regression is a common classification method in supervised lear...
research
11/01/2018

HMLasso: Lasso for High Dimensional and Highly Missing Data

Sparse regression such as Lasso has achieved great success in dealing wi...
research
11/06/2021

In Nonparametric and High-Dimensional Models, Bayesian Ignorability is an Informative Prior

In problems with large amounts of missing data one must model two distin...
research
04/23/2021

Regularized Nonlinear Regression for Simultaneously Selecting and Estimating Key Model Parameters

In system identification, estimating parameters of a model using limited...
research
09/05/2023

Adaptive Bayesian Predictive Inference

Bayesian predictive inference provides a coherent description of entire ...
research
01/23/2020

The Reciprocal Bayesian LASSO

A reciprocal LASSO (rLASSO) regularization employs a decreasing penalty ...
research
04/04/2018

Variable selection using pseudo-variables

Penalized regression has become a standard tool for model building acros...

Please sign up or login with your details

Forgot password? Click here to reset