Efficient nonmyopic Bayesian optimization and quadrature

09/10/2019
by   Shali Jiang, et al.
0

Finite-horizon sequential decision problems arise naturally in many machine learning contexts; examples include Bayesian optimization and Bayesian quadrature. Computing the optimal policy for such problems requires solving Bellman equations, which are generally intractable. Most existing work resorts to myopic approximations by limiting the horizon to only a single time-step, which can perform poorly in balancing exploration and exploitation. We propose a general framework for efficient, nonmyopic approximation of the optimal policy by drawing a connection between the optimal adaptive policy and its non-adaptive counterpart. Our proposal is to compute an optimal batch of points, then select a single point from within this batch to evaluate. We realize this idea for both Bayesian optimization and Bayesian quadrature and demonstrate that our proposed method significantly outperforms common myopic alternatives on a variety of tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2022

Continuous-in-time Limit for Bayesian Bandits

This paper revisits the bandit problem in the Bayesian setting. The Baye...
research
11/04/2019

Lookahead Bayesian Optimization via Rollout: Guarantees and Sequential Rolling Horizons

Lookahead, also known as non-myopic, Bayesian optimization (BO) aims to ...
research
02/25/2012

Hybrid Batch Bayesian Optimization

Bayesian Optimization aims at optimizing an unknown non-convex/concave f...
research
11/21/2018

Efficient nonmyopic active search with applications in drug and materials discovery

Active search is a learning paradigm for actively identifying as many me...
research
11/21/2015

Gaussian Process Planning with Lipschitz Continuous Reward Functions: Towards Unifying Bayesian Optimization, Active Learning, and Beyond

This paper presents a novel nonmyopic adaptive Gaussian process planning...
research
04/28/2016

Sequential Bayesian optimal experimental design via approximate dynamic programming

The design of multiple experiments is commonly undertaken via suboptimal...
research
05/20/2021

Lookahead Acquisition Functions for Finite-Horizon Time-Dependent Bayesian Optimization and Application to Quantum Optimal Control

We propose a novel Bayesian method to solve the maximization of a time-d...

Please sign up or login with your details

Forgot password? Click here to reset