Optimal Stochastic Nonconvex Optimization with Bandit Feedback

03/30/2021
by   Puning Zhao, et al.
0

In this paper, we analyze the continuous armed bandit problems for nonconvex cost functions under certain smoothness and sublevel set assumptions. We first derive an upper bound on the expected cumulative regret of a simple bin splitting method. We then propose an adaptive bin splitting method, which can significantly improve the performance. Furthermore, a minimax lower bound is derived, which shows that our new adaptive method achieves locally minimax optimal expected cumulative regret.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2021

Regret Lower Bound and Optimal Algorithm for High-Dimensional Contextual Linear Bandit

In this paper, we consider the multi-armed bandit problem with high-dime...
research
06/08/2015

Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem

We study the K-armed dueling bandit problem, a variation of the standard...
research
06/03/2021

Bandit Phase Retrieval

We study a bandit version of phase retrieval where the learner chooses a...
research
04/26/2023

Adaptation to Misspecified Kernel Regularity in Kernelised Bandits

In continuum-armed bandit problems where the underlying function resides...
research
07/05/2023

Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

Simple regret minimization is a critical problem in learning optimal tre...
research
03/24/2023

Efficient Lipschitzian Global Optimization of Hölder Continuous Multivariate Functions

This study presents an effective global optimization technique designed ...
research
05/16/2019

Adaptive Sensor Placement for Continuous Spaces

We consider the problem of adaptively placing sensors along an interval ...

Please sign up or login with your details

Forgot password? Click here to reset