Zeroth Order Non-convex optimization with Dueling-Choice Bandits

11/03/2019
by   Yichong Xu, et al.
0

We consider a novel setting of zeroth order non-convex optimization, where in addition to querying the function value at a given point, we can also duel two points and get the point with the larger function value. We refer to this setting as optimization with dueling-choice bandits since both direct queries and duels are available for optimization. We give the COMP-GP-UCB algorithm based on GP-UCB (Srinivas et al., 2009), where instead of directly querying the point with the maximum Upper Confidence Bound (UCB), we perform a constrained optimization and use comparisons to filter out suboptimal points. COMP-GP-UCB comes with theoretical guarantee of O(Φ/√(T)) on simple regret where T is the number of direct queries and Φ is an improved information gain corresponding to a comparison based constraint set that restricts the search space for the optimum. In contrast, in the direct query only setting, Φ depends on the entire domain. Finally, we present experimental results to show the efficacy of our algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2021

Optimal Order Simple Regret for Gaussian Process Bandits

Consider the sequential optimization of a continuous, possibly non-conve...
research
06/16/2021

Ada-BKB: Scalable Gaussian Process Optimization on Continuous Domain by Adaptive Discretization

Gaussian process optimization is a successful class of algorithms (e.g. ...
research
06/09/2020

Scalable Thompson Sampling using Sparse Gaussian Process Models

Thompson Sampling (TS) with Gaussian Process (GP) models is a powerful t...
research
03/09/2012

Regret Bounds for Deterministic Gaussian Process Bandits

This paper analyses the problem of Gaussian process (GP) bandits with de...
research
06/27/2012

Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations

This paper analyzes the problem of Gaussian process (GP) bandits with de...
research
10/24/2020

Private Outsourced Bayesian Optimization

This paper presents the private-outsourced-Gaussian process-upper confid...
research
12/14/2021

Imaginary Zeroth-Order Optimization

Zeroth-order optimization methods are developed to overcome the practica...

Please sign up or login with your details

Forgot password? Click here to reset