Geostatistical Learning: Challenges and Opportunities

02/17/2021
by   Júlio Hoffimann, et al.
0

Statistical learning theory provides the foundation to applied machine learning, and its various successful applications in computer vision, natural language processing and other scientific domains. The theory, however, does not take into account the unique challenges of performing statistical learning in geospatial settings. For instance, it is well known that model errors cannot be assumed to be independent and identically distributed in geospatial (a.k.a. regionalized) variables due to spatial correlation; and trends caused by geophysical processes lead to covariate shifts between the domain where the model was trained and the domain where it will be applied, which in turn harm the use of classical learning methodologies that rely on random samples of the data. In this work, we introduce the geostatistical (transfer) learning problem, and illustrate the challenges of learning from geospatial data by assessing widely-used methods for estimating generalization error of learning models, under covariate shift and spatial correlation. Experiments with synthetic Gaussian process data as well as with real data from geophysical surveys in New Zealand indicate that none of the methods are adequate for model selection in a geospatial context. We provide general guidelines regarding the choice of these methods in practice while new methods are being actively researched.

READ FULL TEXT

page 23

page 24

research
04/17/2022

NICO++: Towards Better Benchmarking for Domain Generalization

Despite the remarkable performance that modern deep neural networks have...
research
10/14/2020

A Distribution-Free Test of Covariate Shift Using Conformal Prediction

Covariate shift is a common and important assumption in transfer learnin...
research
03/02/2022

Estimating Conditional Average Treatment Effects with Missing Treatment Information

Estimating conditional average treatment effects (CATE) is challenging, ...
research
07/17/2023

Revisiting the Robustness of the Minimum Error Entropy Criterion: A Transfer Learning Case Study

Coping with distributional shifts is an important part of transfer learn...
research
12/16/2018

PAC Learning Guarantees Under Covariate Shift

We consider the Domain Adaptation problem, also known as the covariate s...
research
08/07/2018

Importance of the Mathematical Foundations of Machine Learning Methods for Scientific and Engineering Applications

There has been a lot of recent interest in adopting machine learning met...
research
09/30/2022

New Metric Formulas that Include Measurement Errors in Machine Learning for Natural Sciences

The application of machine learning to physics problems is widely found ...

Please sign up or login with your details

Forgot password? Click here to reset