Grid-Parametrize-Split (GriPS) for Improved Scalable Inference in Spatial Big Data Analysis

by   Michele Peruzzi, et al.

Rapid advancements in spatial technologies including Geographic Information Systems (GIS) and remote sensing have generated massive amounts of spatially referenced data in a variety of scientific and data-driven industrial applications. These advancements have led to a substantial, and still expanding, literature on the modeling and analysis of spatially oriented big data. In particular, Bayesian inferences for high-dimensional spatial processes are being sought in a variety of remote-sensing applications including, but not limited to, modeling next generation Light Detection and Ranging (LiDAR) systems and other remotely sensed data. Massively scalable spatial processes, in particular Gaussian processes (GPs), are being explored extensively for the increasingly encountered big data settings. Recent developments include GPs constructed from sparse Directed Acyclic Graphs (DAGs) with a limited number of neighbors (parents) to characterize dependence across the spatial domain. The DAG can be used to devise fast algorithms for posterior sampling of the latent process, but these may exhibit pathological behavior in estimating covariance parameters. While these issues are mitigated by considering marginalized samplers that exploit the underlying sparse precision matrix, these algorithms are slower, less flexible, and oblivious of structure in the data. The current article introduces the Grid-Parametrize-Split (GriPS) approach for conducting Bayesian inference in spatially oriented big data settings by a combination of careful model construction and algorithm design to effectuate substantial improvements in MCMC convergence. We demonstrate the effectiveness of our proposed methods through simulation experiments and subsequently undertake the modeling of LiDAR outcomes and production of their predictive maps using G-LiHT and other remotely sensed variables.



There are no comments yet.


page 22

page 25

page 33


Spatial Factor Modeling: A Bayesian Matrix-Normal Approach for Misaligned Data

Multivariate spatially-oriented data sets are prevalent in the environme...

Bayesian hierarchical modeling and analysis for physical activity trajectories using actigraph data

Rapid developments in streaming data technologies are continuing to gene...

Big Learning with Bayesian Methods

Explosive growth in data and availability of cheap computing resources h...

Learning and Recognizing Archeological Features from LiDAR Data

We present a remote sensing pipeline that processes LiDAR (Light Detecti...

Spatial meshing for general Bayesian multivariate models

Quantifying spatial and/or temporal associations in multivariate geoloca...

Modeling Massive Spatial Datasets Using a Conjugate Bayesian Linear Regression Framework

Geographic Information Systems (GIS) and related technologies have gener...

Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains

We introduce a class of scalable Bayesian hierarchical models for the an...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.