A global-local approach for detecting hotspots in multiple-response regression

11/08/2018
by   Hélène Ruffieux, et al.
0

We tackle modelling and inference for variable selection in regression problems with many predictors and many responses. We focus on detecting hotspots, i.e., predictors associated with several responses. Such a task is critical in statistical genetics, as hotspot genetic variants shape the architecture of the genome by controlling the expression of many genes and may initiate decisive functional mechanisms underlying disease endpoints. Existing hierarchical regression approaches designed to model hotspots suffer from two limitations: their discrimination of hotspots is sensitive to the choice of top-level scale parameters for the propensity of predictors to be hotspots, and they do not scale to large predictor and response vectors, e.g., of dimensions 10^3-10^5 in genetic applications. We address these shortcomings by introducing a flexible hierarchical regression framework that is tailored to the detection of hotspots and scalable to the above dimensions. Our proposal implements a fully Bayesian model for hotspots based on the horseshoe shrinkage prior. Its global-local formulation shrinks noise globally and hence accommodates the highly sparse nature of genetic analyses, while being robust to individual signals, thus leaving the effects of hotspots unshrunk. Inference is carried out using a fast variational algorithm coupled with a novel simulated annealing procedure that allows efficient exploration of multimodal distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2021

An Approach of Bayesian Variable Selection for Ultrahigh Dimensional Multivariate Regression

In many practices, scientists are particularly interested in detecting w...
research
04/11/2021

Parallel integrative learning for large-scale multi-response regression with incomplete outcomes

Multi-task learning is increasingly used to investigate the association ...
research
01/04/2011

Sparse Partitioning: Nonlinear regression with binary or tertiary predictors, with application to association studies

This paper presents Sparse Partitioning, a Bayesian method for identifyi...
research
07/29/2021

CARlasso: An R package for the estimation of sparse microbial networks with predictors

Microbiome data analyses require statistical tools that can simultaneous...
research
03/12/2023

Bayesian Size-and-Shape regression modelling

Building on Dryden et al. (2021), this note presents the Bayesian estima...
research
09/29/2021

Deep neural networks with controlled variable selection for the identification of putative causal genetic variants

Deep neural networks (DNN) have been used successfully in many scientifi...
research
12/15/2020

Bayesian Conditional Auto-Regressive LASSO Models to Learn Sparse Networks with Predictors

Microbiome data generated by next generation sequencing continue to flou...

Please sign up or login with your details

Forgot password? Click here to reset