Large-scale local surrogate modeling of stochastic simulation experiments

09/11/2021
by   D. Austin Cole, et al.
0

Gaussian process (GP) regression in large-data contexts, which often arises in surrogate modeling of stochastic simulation experiments, is challenged by cubic runtimes. Coping with input-dependent noise in that setting is doubly so. Recent advances target reduced computational complexity through local approximation (e.g., LAGP) or otherwise induced sparsity. Yet these do not economically accommodate a common design feature when attempting to separate signal from noise. Replication can offer both statistical and computational efficiencies, motivating several extensions to the local surrogate modeling toolkit. Introducing a nugget into a local kernel structure is just the first step. We argue that a new inducing point formulation (LIGP), already preferred over LAGP on the speed-vs-accuracy frontier, conveys additional advantages when replicates are involved. Woodbury identities allow local kernel structure to be expressed in terms of unique design locations only, increasing the amount of data (i.e., the neighborhood size) that may be leveraged without additional flops. We demonstrate that this upgraded LIGP provides more accurate prediction and uncertainty quantification compared to several modern alternatives. Illustrations are provided on benchmark data, real-world simulation experiments on epidemic management and ocean oxygen concentration, and in an options pricing control framework.

READ FULL TEXT
research
10/09/2017

Replication or exploration? Sequential design for stochastic simulation experiments

In this paper we investigate the merits of replication, and provide meth...
research
08/28/2020

Locally induced Gaussian processes for large-scale simulation experiments

Gaussian processes (GPs) serve as flexible surrogates for complex surfac...
research
07/20/2022

Machine learning and geospatial methods for large-scale mining data

The canonical technique for nonlinear modeling of spatial and other poin...
research
12/01/2017

Emulating satellite drag from large simulation experiments

Obtaining accurate estimates of satellite drag coefficients in low Earth...
research
01/30/2020

Towards a Kernel based Physical Interpretation of Model Uncertainty

This paper introduces a new information theoretic framework that provide...
research
05/07/2021

Local approximate Gaussian process regression for data-driven constitutive laws: Development and comparison with neural networks

Hierarchical computational methods for multiscale mechanics such as the ...
research
09/05/2016

GTApprox: surrogate modeling for industrial design

We describe GTApprox - a new tool for medium-scale surrogate modeling in...

Please sign up or login with your details

Forgot password? Click here to reset