Comparison of new computational methods for geostatistical modelling of malaria

05/03/2023
by   Spencer Wong, et al.
0

Geostatistical analysis of health data is increasingly used to model spatial variation in malaria prevalence, burden, and other metrics. Traditional inference methods for geostatistical modelling are notoriously computationally intensive, motivating the development of newer, approximate methods. The appeal of faster methods is particularly great as the size of the region and number of spatial locations being modelled increases. Methods We present an applied comparison of four proposed `fast' geostatistical modelling methods and the software provided to implement them – Integrated Nested Laplace Approximation (INLA), tree boosting with Gaussian processes and mixed effect models (GPBoost), Fixed Rank Kriging (FRK) and Spatial Random Forests (SpRF). We illustrate the four methods by estimating malaria prevalence on two different spatial scales – country and continent. We compare the performance of the four methods on these data in terms of accuracy, computation time, and ease of implementation. Results Two of these methods – SpRF and GPBoost – do not scale well as the data size increases, and so are likely to be infeasible for larger-scale analysis problems. The two remaining methods – INLA and FRK – do scale well computationally, however the resulting model fits are very sensitive to the user's modelling assumptions and parameter choices. Conclusions INLA and FRK both enable scalable geostatistical modelling of malaria prevalence data. However care must be taken when using both methods to assess the fit of the model to data and plausibility of predictions, in order to select appropriate model assumptions and approximation parameters.

READ FULL TEXT

page 10

page 14

page 16

page 34

page 36

research
09/08/2020

Spatial Bayesian Hierarchical Modelling with Integrated Nested Laplace Approximation

We consider latent Gaussian fields for modelling spatial dependence in t...
research
07/17/2019

Multi-Scale Process Modelling and Distributed Computation for Spatial Data

Recent years have seen a huge development in spatial modelling and predi...
research
05/19/2021

Modelling short-term precipitation extremes with the blended generalised extreme value distribution

The yearly maxima of short-term precipitation are modelled to produce im...
research
05/24/2020

Bayesian Multiresolution Modeling Of Georeferenced Data

Current implementations of multiresolution methods are limited in terms ...
research
05/22/2023

Incorporating Subsampling into Bayesian Models for High-Dimensional Spatial Data

Additive spatial statistical models with weakly stationary process assum...
research
07/30/2020

A Vecchia Approximation for High-Dimensional Gaussian Cumulative Distribution Functions Arising from Spatial Data

We introduce an approach to quickly and accurately approximate the cumul...
research
02/09/2015

Evaluation of modelling approaches for predicting the spatial distribution of soil organic carbon stocks at the national scale

Soil organic carbon (SOC) plays a major role in the global carbon budget...

Please sign up or login with your details

Forgot password? Click here to reset