Bayesian predictive modeling of multi-source multi-way data

08/05/2022
by   Jonathan Kim, et al.
0

We develop a Bayesian approach to predict a continuous or binary outcome from data that are collected from multiple sources with a multi-way (i.e.. multidimensional tensor) structure. As a motivating example we consider molecular data from multiple 'omics sources, each measured over multiple developmental time points, as predictors of early-life iron deficiency (ID) in a rhesus monkey model. We use a linear model with a low-rank structure on the coefficients to capture multi-way dependence and model the variance of the coefficients separately across each source to infer their relative contributions. Conjugate priors facilitate an efficient Gibbs sampling algorithm for posterior inference, assuming a continuous outcome with normal errors or a binary outcome with a probit link. Simulations demonstrate that our model performs as expected in terms of misclassification rates and correlation of estimated coefficients with true coefficients, with large gains in performance by incorporating multi-way structure and modest gains when accounting for differing signal sizes across the different sources. Moreover, it provides robust classification of ID monkeys for our motivating application. Software in the form of R code is available at https://github.com/BiostatsKim/BayesMSMW .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2017

Tensor-on-tensor regression

We propose a framework for the linear prediction of a multi-way array (i...
research
01/31/2019

Bayesian nonparametric multiway regression for clustered binomial data

We introduce a Bayesian nonparametric regression model for data with mul...
research
06/26/2016

Discriminating sample groups with multi-way data

High-dimensional linear classifiers, such as the support vector machine ...
research
02/26/2021

sJIVE: Supervised Joint and Individual Variation Explained

Analyzing multi-source data, which are multiple views of data on the sam...
research
05/11/2018

TensOrMachine: Probabilistic Boolean Tensor Decomposition

Boolean tensor decomposition approximates data of multi-way binary relat...
research
11/29/2022

Bayesian Simultaneous Factorization and Prediction Using Multi-Omic Data

Understanding of the pathophysiology of obstructive lung disease (OLD) i...
research
03/28/2023

Large-scale Training Data Search for Object Re-identification

We consider a scenario where we have access to the target domain, but ca...

Please sign up or login with your details

Forgot password? Click here to reset