Spatial Multivariate Trees for Big Data Bayesian Regression

12/02/2020
by   Michele Peruzzi, et al.
0

High resolution geospatial data are challenging because standard geostatistical models based on Gaussian processes are known to not scale to large data sizes. While progress has been made towards methods that can be computed more efficiently, considerably less attention has been devoted to big data methods that allow the description of complex relationships between several outcomes recorded at high resolutions by different sensors. Our Bayesian multivariate regression models based on spatial multivariate trees (SpamTrees) achieve scalability via conditional independence assumptions on latent random effects following a treed directed acyclic graph. Information-theoretic arguments and considerations on computational efficiency guide the construction of the tree and the related efficient sampling algorithms in imbalanced multivariate settings. In addition to simulated data examples, we illustrate SpamTrees using a large climate data set which combines satellite data with land-based station data. Source code is available at https://github.com/mkln/spamtree

READ FULL TEXT

page 2

page 25

page 41

research
03/25/2020

Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains

We introduce a class of scalable Bayesian hierarchical models for the an...
research
04/05/2022

GP-BART: a novel Bayesian additive regression trees approach using Gaussian processes

The Bayesian additive regression trees (BART) model is an ensemble metho...
research
01/25/2022

Spatial meshing for general Bayesian multivariate models

Quantifying spatial and/or temporal associations in multivariate geoloca...
research
10/08/2022

Spatial predictions on physically constrained domains: Applications to Arctic sea salinity data

In this paper, we predict sea surface salinity (SSS) in the Arctic Ocean...
research
05/15/2019

IPC: A Benchmark Data Set for Learning with Graph-Structured Data

Benchmark data sets are an indispensable ingredient of the evaluation of...
research
01/05/2021

Bayesian hierarchical modeling and analysis for physical activity trajectories using actigraph data

Rapid developments in streaming data technologies are continuing to gene...
research
03/26/2022

Influential Observations in Bayesian Regression Tree Models

BCART (Bayesian Classification and Regression Trees) and BART (Bayesian ...

Please sign up or login with your details

Forgot password? Click here to reset