BIVAS: A scalable Bayesian method for bi-level variable selection with applications

03/28/2018
by   Mingxuan Cai, et al.
0

In this paper, we consider a Bayesian bi-level variable selection problem in high-dimensional regressions. In many practical situations, it is natural to assign group membership to each predictor. Examples include that genetic variants can be grouped at the gene level and a covariate from different tasks naturally forms a group. Thus, it is of interest to select important groups as well as important members from those groups. The existing Markov Chain Monte Carlo (MCMC) methods are often computationally intensive and not scalable to large data sets. To address this problem, we consider variational inference for bi-level variable selection (BIVAS). In contrast to the commonly used mean-field approximation, we propose a hierarchical factorization to approximate the posterior distribution, by utilizing the structure of bi-level variable selection. Moreover, we develop a computationally efficient and fully parallelizable algorithm based on this variational approximation. We further extend the developed method to model data sets from multi-task learning. The comprehensive numerical results from both simulation studies and real data analysis demonstrate the advantages of BIVAS for variable selection, parameter estimation and computational efficiency over existing methods. The method is implemented in R package `bivas' available at https://github.com/mxcai/bivas.

READ FULL TEXT

page 15

page 17

research
06/28/2022

Bayesian Multi-task Variable Selection with an Application to Differential DAG Analysis

In this paper, we study the Bayesian multi-task variable selection probl...
research
11/26/2019

A High-dimensional M-estimator Framework for Bi-level Variable Selection

In high-dimensional data analysis, bi-level sparsity is often assumed wh...
research
09/19/2017

varbvs: Fast Variable Selection for Large-scale Regression

We introduce varbvs, a suite of functions written in R and MATLAB for re...
research
03/22/2023

Scalable Bayesian bi-level variable selection in generalized linear models

Motivated by a real-world application in cardiology, we develop an algor...
research
10/12/2018

Fast approximate inference for variable selection in Dirichlet process mixtures, with an application to pan-cancer proteomics

The Dirichlet Process (DP) mixture model has become a popular choice for...
research
11/08/2018

Variational Bayesian hierarchical regression for data analysis

Collected data, which is used for analysis or prediction tasks, often ha...
research
02/04/2018

Simultaneous Selection of Multiple Important Single Nucleotide Polymorphisms in Familial Genome Wide Association Studies Data

We propose a resampling-based fast variable selection technique for sele...

Please sign up or login with your details

Forgot password? Click here to reset