BayesSUR: An R package for high-dimensional multivariate Bayesian variable and covariance selection in linear regression

04/28/2021
by   Zhi Zhao, et al.
0

In molecular biology, advances in high-throughput technologies have made it possible to study complex multivariate phenotypes and their simultaneous associations with high-dimensional genomic and other omics data, a problem that can be studied with high-dimensional multi-response regression, where the response variables are potentially highly correlated. To this purpose, we recently introduced several multivariate Bayesian variable and covariance selection models, e.g., Bayesian estimation methods for sparse seemingly unrelated regression for variable and covariance selection. Several variable selection priors have been implemented in this context, in particular the hotspot detection prior for latent variable inclusion indicators, which results in sparse variable selection for associations between predictors and multiple phenotypes. We also propose an alternative, which uses a Markov random field (MRF) prior for incorporating prior knowledge about the dependence structure of the inclusion indicators. Inference of Bayesian seemingly unrelated regression (SUR) by Markov chain Monte Carlo methods is made computationally feasible by factorisation of the covariance matrix amongst the response variables. In this paper we present BayesSUR, an R package, which allows the user to easily specify and run a range of different Bayesian SUR models, which have been implemented in C++ for computational efficiency. The R package allows the specification of the models in a modular way, where the user chooses the priors for variable selection and for covariance selection separately. We demonstrate the performance of sparse SUR models with the hotspot prior and spike-and-slab MRF prior on synthetic and real data sets representing eQTL or mQTL studies and in vitro anti-cancer drug screening studies as examples for typical applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2021

Structured Bayesian variable selection for multiple related response variables and high-dimensional predictors

It is becoming increasingly common to study the complex association betw...
research
04/28/2022

A robust Bayesian analysis of variable selection under prior ignorance

We propose a cautious Bayesian variable selection routine by investigati...
research
05/21/2019

Bayesian semiparametric analysis of multivariate continuous responses, with variable selection

We develop models for multivariate Gaussian responses with nonparametric...
research
11/15/2021

An Approach of Bayesian Variable Selection for Ultrahigh Dimensional Multivariate Regression

In many practices, scientists are particularly interested in detecting w...
research
12/06/2017

Targeted Random Projection for Prediction from High-Dimensional Features

We consider the problem of computationally-efficient prediction from hig...
research
12/17/2020

Bayesian semiparametric modelling of covariance matrices for multivariate longitudinal data

The article develops marginal models for multivariate longitudinal respo...
research
03/10/2023

Informative co-data learning for high-dimensional Horseshoe regression

High-dimensional data often arise from clinical genomics research to inf...

Please sign up or login with your details

Forgot password? Click here to reset