Random forests for binary geospatial data

02/27/2023
by   Arkajyoti Saha, et al.
0

Binary geospatial data is commonly analyzed with generalized linear mixed models, specified with a linear fixed covariate effect and a Gaussian Process (GP)-distributed spatial random effect, relating to the response via a link function. The assumption of linear covariate effects is severely restrictive. Random Forests (RF) are increasingly being used for non-linear modeling of spatial data, but current extensions of RF for binary spatial data depart the mixed model setup, relinquishing inference on the fixed effects and other advantages of using GP. We propose RF-GP, using Random Forests for estimating the non-linear covariate effect and Gaussian Processes for modeling the spatial random effects directly within the generalized mixed model framework. We observe and exploit equivalence of Gini impurity measure and least squares loss to propose an extension of RF for binary data that accounts for the spatial dependence. We then propose a novel link inversion algorithm that leverages the properties of GP to estimate the covariate effects and offer spatial predictions. RF-GP outperforms existing RF methods for estimation and prediction in both simulated and real-world data. We establish consistency of RF-GP for a general class of β-mixing binary processes that includes common choices like spatial Matérn GP and autoregressive processes.

READ FULL TEXT

page 26

page 27

research
07/30/2020

Random Forests for dependent data

Random forest (RF) is one of the most popular methods for estimating reg...
research
04/18/2023

Neural networks for geospatial data

Analysis of geospatial data has traditionally been model-based, with a m...
research
05/02/2018

Toward a diagnostic toolkit for linear models with Gaussian-process distributed random effects

Gaussian processes (GPs) are widely used as distributions of random effe...
research
06/10/2015

Randomer Forests

Random forests (RF) is a popular general purpose classifier that has bee...
research
11/15/2017

Modeling Binary Time Series Using Gaussian Processes with Application to Predicting Sleep States

Motivated by the problem of predicting sleep states, we develop a mixed ...
research
03/12/2020

Spatial Tweedie exponential dispersion models

This paper proposes a general modeling framework that allows for uncerta...
research
10/26/2017

Statistical Inference on Tree Swallow Migrations, Using Random Forests

Species migratory patterns have typically been studied through individua...

Please sign up or login with your details

Forgot password? Click here to reset