A Class of Models for Large Zero-inflated Spatial Data

04/05/2023
by   Ben Seiyon Lee, et al.
0

Spatially correlated data with an excess of zeros, usually referred to as zero-inflated spatial data, arise in many disciplines. Examples include count data, for instance, abundance (or lack thereof) of animal species and disease counts, as well as semi-continuous data like observed precipitation. Spatial two-part models are a flexible class of models for such data. Fitting two-part models can be computationally expensive for large data due to high-dimensional dependent latent variables, costly matrix operations, and slow mixing Markov chains. We describe a flexible, computationally efficient approach for modeling large zero-inflated spatial data using the projection-based intrinsic conditional autoregression (PICAR) framework. We study our approach, which we call PICAR-Z, through extensive simulation studies and two environmental data sets. Our results suggest that PICAR-Z provides accurate predictions while remaining computationally efficient. An important goal of our work is to allow researchers who are not experts in computation to easily build computationally efficient extensions to zero-inflated spatial models; this also allows for a more thorough exploration of modeling choices in two-part models than was previously possible. We show that PICAR-Z is easy to implement and extend in popular probabilistic programming languages such as nimble and stan.

READ FULL TEXT
research
12/05/2019

PICAR: An Efficient Extendable Approach for Fitting Hierarchical Spatial Models

Hierarchical spatial models are very flexible and popular for a vast arr...
research
03/23/2020

High-dimensional multivariate Geostatistics: A Bayesian Matrix-Normal Approach

Joint modeling of spatially-oriented dependent variables are commonplace...
research
11/13/2022

Flexible Basis Representations for Modeling High-Dimensional Hierarchical Spatial Data

Nonstationary and non-Gaussian spatial data are prevalent across many fi...
research
10/22/2019

Reduced-dimensional Monte Carlo Maximum Likelihood for Latent Gaussian Random Field Models

Monte Carlo maximum likelihood (MCML) provides an elegant approach to fi...
research
01/28/2019

A Spatially Discrete Approximation to Log-Gaussian Cox Processes for Modelling Aggregated Disease Count Data

In this paper, we develop a computationally efficient discrete approxima...
research
04/29/2022

Distributed Inference for Spatial Extremes Modeling in High Dimensions

Extreme environmental events frequently exhibit spatial and temporal dep...
research
10/17/2020

Fast Spatial Autocorrelation

Physical or geographic location proves to be an important feature in man...

Please sign up or login with your details

Forgot password? Click here to reset