Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples

07/17/2019
by   Themistoklis P. Sapsis, et al.
2

For many important problems the quantity of interest (or output) is an unknown function of the parameter space (or input), which is a random vector with known statistics. Since the dependence of the output on this random vector is unknown, the challenge is to identify its statistics, using the minimum number of function evaluations. This is a problem that can been seen in the context of active learning or optimal experimental design. We employ Bayesian regression to represent the derived model uncertainty due to finite and small number of input-output pairs. In this context we evaluate existing methods for optimal sample selection, such as model error minimization and mutual information maximization. We show that the commonly employed criteria in the literature do not take into account the output values of the existing input-output pairs. To overcome this deficiency we introduce a new criterion that explicitly takes into account the values of the output for the existing samples and adaptively selects inputs from regions or dimensions of the parameter space which have important contribution to the output. The new method allows for application to a large number of input variables, paving the way for optimal experimental design in very high-dimensions.

READ FULL TEXT

page 20

page 28

research
08/08/2018

Active Learning for Regression Using Greedy Sampling

Regression problems are pervasive in real-world applications. Generally ...
research
04/02/2020

An Upgrading Algorithm with Optimal Power Law

Consider a channel W along with a given input distribution P_X. In certa...
research
04/19/2018

A sequential sampling strategy for extreme event statistics in nonlinear dynamical systems

We develop a method for the evaluation of extreme event statistics assoc...
research
09/05/2019

LSMI-Sinkhorn: Semi-supervised Squared-Loss Mutual Information Estimation with Optimal Transport

Estimating mutual information is an important machine learning and stati...
research
02/27/2022

Bayesian Active Learning for Discrete Latent Variable Models

Active learning seeks to reduce the number of samples required to estima...
research
02/22/2021

Sequential Bayesian experimental design for estimation of extreme-event probability in stochastic dynamical systems

We consider a dynamical system with two sources of uncertainties: (1) pa...
research
04/17/2017

Fast multi-output relevance vector regression

This paper aims to decrease the time complexity of multi-output relevanc...

Please sign up or login with your details

Forgot password? Click here to reset