A Statistical Learning View of Simple Kriging

02/15/2022
by   Emilia Siviero, et al.
1

In the Big Data era, with the ubiquity of geolocation sensors in particular, massive datasets exhibiting a possibly complex spatial dependence structure are becoming increasingly available. In this context, the standard probabilistic theory of statistical learning does not apply directly and guarantees of the generalization capacity of predictive rules learned from such data are left to establish. We analyze here the simple Kriging task, the flagship problem in Geostatistics: the values of a square integrable random field X={X_s}_s∈ S, S⊂ℝ^2, with unknown covariance structure are to be predicted with minimum quadratic risk, based upon observing a single realization of the spatial process at a finite number of locations s_1, …, s_n in S. Despite the connection of this minimization problem with kernel ridge regression, establishing the generalization capacity of empirical risk minimizers is far from straightforward, due to the non i.i.d. nature of the spatial data X_s_1, …, X_s_n involved. In this article, nonasymptotic bounds of order O_ℙ(1/n) are proved for the excess risk of a plug-in predictive rule mimicking the true minimizer in the case of isotropic stationary Gaussian processes observed at locations forming a regular grid. These theoretical results, as well as the role played by the technical conditions required to establish them, are illustrated by various numerical experiments and hopefully pave the way for further developments in statistical learning based on spatial data.

READ FULL TEXT

page 10

page 27

page 28

page 29

research
06/05/2019

Empirical Risk Minimization under Random Censorship: Theory and Practice

We consider the classic supervised learning problem, where a continuous ...
research
02/08/2022

Basis-Function Models in Spatial Statistics

Spatial statistics is concerned with the analysis of data that have spat...
research
06/28/2019

Statistical Learning from Biased Training Samples

With the deluge of digitized information in the Big Data era, massive da...
research
01/22/2013

The connection between Bayesian estimation of a Gaussian random field and RKHS

Reconstruction of a function from noisy data is often formulated as a re...
research
12/16/2019

Realization of spatial sparseness by deep ReLU nets with massive data

The great success of deep learning poses urgent challenges for understan...
research
03/06/2023

On Regression in Extreme Regions

In the classic regression problem, the value of a real-valued random var...
research
06/26/2020

Nearest Neighbour Based Estimates of Gradients: Sharp Nonasymptotic Bounds and Applications

Motivated by a wide variety of applications, ranging from stochastic opt...

Please sign up or login with your details

Forgot password? Click here to reset