For high-dimensional hierarchical models, consider exchangeability of effects across covariates instead of across datasets

07/13/2021
by   Brian L Trippe, et al.
0

Hierarchical Bayesian methods enable information sharing across multiple related regression problems. While standard practice is to model regression parameters (effects) as (1) exchangeable across datasets and (2) correlated to differing degrees across covariates, we show that this approach exhibits poor statistical performance when the number of covariates exceeds the number of datasets. For instance, in statistical genetics, we might regress dozens of traits (defining datasets) for thousands of individuals (responses) on up to millions of genetic variants (covariates). When an analyst has more covariates than datasets, we argue that it is often more natural to instead model effects as (1) exchangeable across covariates and (2) correlated to differing degrees across datasets. To this end, we propose a hierarchical model expressing our alternative perspective. We devise an empirical Bayes estimator for learning the degree of correlation between datasets. We develop theory that demonstrates that our method outperforms the classic approach when the number of covariates dominates the number of datasets, and corroborate this result empirically on several high-dimensional multiple regression and classification problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2021

High-Dimensional Varying Coefficient Models with Functional Random Effects

We consider a sparse high-dimensional varying coefficients model with ra...
research
06/26/2018

The conditionality principle in high-dimensional regression

Consider a high-dimensional linear regression problem, where the number ...
research
10/24/2016

C-mix: a high dimensional mixture model for censored durations, with applications to genetic data

We introduce a mixture model for censored durations (C-mix), and develop...
research
01/24/2021

NeurT-FDR: Controlling FDR by Incorporating Feature Hierarchy

Controlling false discovery rate (FDR) while leveraging the side informa...
research
11/17/2022

Bayesian Hierarchical Models For Multi-type Survey Data Using Spatially Correlated Covariates Measured With Error

We introduce Bayesian hierarchical models for predicting high-dimensiona...
research
05/11/2020

Ensembled sparse-input hierarchical networks for high-dimensional datasets

Neural networks have seen limited use in prediction for high-dimensional...
research
11/04/2019

"Predicting" after peeking into the future: Correcting a fundamental flaw in the SAOM – TERGM comparison of Leifeld and Cranmer (2019)

We review the empirical comparison of SAOMs and TERGMs by Leifeld and Cr...

Please sign up or login with your details

Forgot password? Click here to reset