Lookahead Bayesian Optimization via Rollout: Guarantees and Sequential Rolling Horizons

11/04/2019
by   Xubo Yue, et al.
0

Lookahead, also known as non-myopic, Bayesian optimization (BO) aims to find optimal sampling policies through solving a dynamic programming (DP) formulation that maximizes a long-term reward over a rolling horizon. Though promising, lookahead BO faces the risk of error propagation through its increased dependence on a possibly mis-specified model. In this work we focus on the rollout approximation for solving the intractable DP. We first prove the improving nature of rollout in tackling lookahead BO. We then provide both a theoretical and practical guideline to decide on the rolling horizon stagewise. This guideline is built on quantifying the negative effect of a mis-specified model. To illustrate our idea, we provide case studies on both single and multi-information source BO. Empirical results show the advantageous properties of our method over several myopic and non-myopic BO algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2019

Efficient nonmyopic Bayesian optimization and quadrature

Finite-horizon sequential decision problems arise naturally in many mach...
research
07/13/2022

Non-Myopic Multifidelity Bayesian Optimization

Bayesian optimization is a popular framework for the optimization of bla...
research
02/19/2021

Learning to Stop with Surprisingly Few Samples

We consider a discounted infinite horizon optimal stopping problem. If t...
research
12/15/2022

Rollout Algorithms and Approximate Dynamic Programming for Bayesian Optimization and Sequential Estimation

We provide a unifying approximate dynamic programming framework that app...
research
12/28/2021

Robustness and risk management via distributional dynamic programming

In dynamic programming (DP) and reinforcement learning (RL), an agent le...
research
05/20/2021

Lookahead Acquisition Functions for Finite-Horizon Time-Dependent Bayesian Optimization and Application to Quantum Optimal Control

We propose a novel Bayesian method to solve the maximization of a time-d...

Please sign up or login with your details

Forgot password? Click here to reset