A combined statistical and machine learning approach for spatial prediction of extreme wildfire frequencies and sizes

12/30/2021
by   Daniela Cisneros, et al.
0

Motivated by the Extreme Value Analysis 2021 (EVA 2021) data challenge we propose a method based on statistics and machine learning for the spatial prediction of extreme wildfire frequencies and sizes. This method is tailored to handle large datasets, including missing observations. Our approach relies on a four-stage high-dimensional bivariate sparse spatial model for zero-inflated data, which is developed using stochastic partial differential equations(SPDE). In Stage 1, the observations are categorized in zero/nonzero categories and are modeled using a two-layered hierarchical Bayesian sparse spatial model to estimate the probabilities of these two categories. In Stage 2, before modeling the positive observations using spatially-varying coefficients, smoothed parameter surfaces are obtained from empirical estimates using fixed rank kriging. This approximate Bayesian method inference was employed to avoid the high computational burden of large spatial data modeling using spatially-varying coefficients. In Stage 3, the standardized log-transformed positive observations from the second stage are further modeled using a sparse bivariate spatial Gaussian process. The Gaussian distribution assumption for wildfire counts developed in the third stage is computationally effective but erroneous. Thus in Stage 4, the predicted values are rectified using Random Forests. The posterior inference is drawn for Stages 1 and 3 using Markov chain Monte Carlo (MCMC) sampling. A cross-validation scheme is then created for the artificially generated gaps, and the EVA 2021 prediction scores of the proposed model are compared to those obtained using certain natural competitors.

READ FULL TEXT

page 7

page 10

page 13

page 24

page 26

page 28

page 29

page 31

research
10/13/2021

Fast Approximate Inference for Spatial Extreme Value Models

The generalized extreme value (GEV) distribution is a popular model for ...
research
12/31/2018

A semiparametric Bayesian model for spatiotemporal extremes

In this paper, we consider a Dirichlet process mixture of spatial skew-t...
research
11/08/2022

Domain-decomposed Bayesian inversion based on local Karhunen-Loève expansions

In many Bayesian inverse problems the goal is to recover a spatially var...
research
05/20/2022

Joint modeling of landslide counts and sizes using spatial marked point processes with sub-asymptotic mark distributions

To accurately quantify landslide hazard in a region of Turkey, we develo...
research
10/23/2020

Geostatistical models for zero-inflated data and extreme values

Understanding the spatial distribution of animals, during all their life...
research
02/19/2019

Penalized basis models for very large spatial datasets

Many modern spatial models express the stochastic variation component as...
research
04/01/2020

Bayesian space-time gap filling for inference on extreme hot-spots: an application to Red Sea surface temperatures

We develop a method for probabilistic prediction of extreme value hot-sp...

Please sign up or login with your details

Forgot password? Click here to reset