Recursive Two-Step Lookahead Expected Payoff for Time-Dependent Bayesian Optimization

06/14/2020
by   S. Ashwin Renganathan, et al.
0

We propose a novel Bayesian method to solve the maximization of a time-dependent expensive-to-evaluate oracle. We are interested in the decision that maximizes the oracle at a finite time horizon, when relatively few noisy evaluations can be performed before the horizon. Our recursive, two-step lookahead expected payoff (r2LEY) acquisition function makes nonmyopic decisions at every stage by maximizing the estimated expected value of the oracle at the horizon. r2LEY circumvents the evaluation of the expensive multistep (more than two steps) lookahead acquisition function by recursively optimizing a two-step lookahead acquisition function at every stage; unbiased estimators of this latter function and its gradient are utilized for efficient optimization. r2LEY is shown to exhibit natural exploration properties far from the time horizon, enabling accurate emulation of the oracle, which is exploited in the final decision made at the horizon. To demonstrate the utility of r2LEY, we compare it with time-dependent extensions of popular myopic acquisition functions via both synthetic and real-world datasets.

READ FULL TEXT

page 6

page 8

page 14

research
05/20/2021

Lookahead Acquisition Functions for Finite-Horizon Time-Dependent Bayesian Optimization and Application to Quantum Optimal Control

We propose a novel Bayesian method to solve the maximization of a time-d...
research
03/28/2023

qEUBO: A Decision-Theoretic Acquisition Function for Preferential Bayesian Optimization

Preferential Bayesian optimization (PBO) is a framework for optimizing a...
research
05/25/2018

Maximizing acquisition functions for Bayesian optimization

Bayesian optimization is a sample-efficient approach to global optimizat...
research
08/01/2019

No-PASt-BO: Normalized Portfolio Allocation Strategy for Bayesian Optimization

Bayesian Optimization (BO) is a framework for black-box optimization tha...
research
04/26/2021

One-parameter family of acquisition functions for efficient global optimization

Bayesian optimization (BO) with Gaussian processes is a powerful methodo...
research
05/29/2018

Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning

In this paper, we propose to combine imitation and reinforcement learnin...
research
04/08/2022

Decision-Dependent Risk Minimization in Geometrically Decaying Dynamic Environments

This paper studies the problem of expected loss minimization given a dat...

Please sign up or login with your details

Forgot password? Click here to reset