Predicting paleoclimate from compositional data using multivariate Gaussian process inverse prediction

03/12/2019
by   John R. Tipton, et al.
0

Multivariate compositional count data arise in many applications including ecology, microbiology, genetics, and paleoclimate. A frequent question in the analysis of multivariate compositional count data is what values of a covariate(s) give rise to the observed composition. Learning the relationship between covariates and the compositional count allows for inverse prediction of unobserved covariates given compositional count observations. Gaussian processes provide a flexible framework for modeling functional responses with respect to a covariate without assuming a functional form. Many scientific disciplines use Gaussian process approximations to improve prediction and make inference on latent processes and parameters. When prediction is desired on unobserved covariates given realizations of the response variable, this is called inverse prediction. Because inverse prediction is mathematically and computationally challenging, predicting unobserved covariates often requires fitting models that are different from the hypothesized generative model. We present a novel computational framework that allows for efficient inverse prediction using a Gaussian process approximation to generative models. Our framework enables scientific learning about how the latent processes co-vary with respect to covariates while simultaneously providing predictions of missing covariates. The proposed framework is capable of efficiently exploring the high dimensional, multi-modal latent spaces that arise in the inverse problem. To demonstrate flexibility, we apply our method in a generalized linear model framework to predict latent climate states given multivariate count data. Based on cross-validation, our model has predictive skill competitive with current methods while simultaneously providing formal, statistical inference on the underlying community dynamics of the biological system previously not available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2018

Heterogeneous Multi-output Gaussian Process Prediction

We present a novel extension of multi-output Gaussian processes for hand...
research
05/01/2020

Posterior Consistency of Bayesian Inverse Regression and Inverse Reference Distributions

We consider Bayesian inference in inverse regression problems where the ...
research
11/17/2022

Bayesian Hierarchical Models For Multi-type Survey Data Using Spatially Correlated Covariates Measured With Error

We introduce Bayesian hierarchical models for predicting high-dimensiona...
research
08/29/2023

Multi-Response Heteroscedastic Gaussian Process Models and Their Inference

Despite the widespread utilization of Gaussian process models for versat...
research
07/28/2020

Multi-Output Gaussian Processes with Functional Data: A Study on Coastal Flood Hazard Assessment

Most of the existing coastal flood Forecast and Early-Warning Systems do...
research
09/07/2018

Joint species distribution modeling with additive multivariate Gaussian process priors and heteregenous data

In this work, we propose JSDMs where the responses to environmental cova...
research
11/15/2017

Modeling Binary Time Series Using Gaussian Processes with Application to Predicting Sleep States

Motivated by the problem of predicting sleep states, we develop a mixed ...

Please sign up or login with your details

Forgot password? Click here to reset