Distance-distributed design for Gaussian process surrogates

12/06/2018
by   Boya Zhang, et al.
0

A common challenge in computer experiments and related fields is to efficiently explore the input space using a small number of samples, i.e., the experimental design problem. Much of the recent focus in the computer experiment literature, where modeling is often via Gaussian process (GP) surrogates, has been on space-filling designs, via maximin distance, Latin hypercube, etc. However, it is easy to demonstrate empirically that such designs disappoint when the model hyperparameterization is unknown, and must be estimated from data observed at the chosen design sites. This is true even when the performance metric is prediction-based, or when the target of interest is inherently or eventually sequential in nature, such as in blackbox (Bayesian) optimization. Here we expose such inefficiencies, showing that in many cases purely random design is superior to higher-powered alternatives. We then propose a family of new schemes by reverse engineering the qualities of the random designs which give the best estimates of GP lengthscales. Specifically, we study the distribution of pairwise distances between design elements, and develop a numerical scheme to optimize those distances for a given sample size and dimension. We illustrate how our distance-based designs, and their hybrids with more conventional space-filling schemes, outperform in both static (one-shot design) and sequential settings.

READ FULL TEXT

page 10

page 12

research
05/12/2023

Sequential Designs for Filling Output Spaces

Space-filling designs are commonly used in computer experiments to fill ...
research
09/05/2018

Controlled Random Search Improves Sample Mining and Hyper-Parameter Optimization

A common challenge in machine learning and related fields is the need to...
research
10/09/2017

Replication or exploration? Sequential design for stochastic simulation experiments

In this paper we investigate the merits of replication, and provide meth...
research
05/20/2023

Sequential Latin Hypercube Design for Two-layer Computer Simulators

The two-layer computer simulators are commonly used to mimic multi-physi...
research
12/16/2017

A Spectral Approach for the Design of Experiments: Design, Analysis and Algorithms

This paper proposes a new approach to construct high quality space-filli...
research
01/24/2022

Design Strategies and Approximation Methods for High-Performance Computing Variability Management

Performance variability management is an active research area in high-pe...
research
04/06/2018

Optimal Design Emulators: A Point Process Approach

Design of experiments is a fundamental topic in applied statistics with ...

Please sign up or login with your details

Forgot password? Click here to reset