Projection-Free Bandit Convex Optimization

05/18/2018
by   Lin Chen, et al.
0

In this paper, we propose the first computationally efficient projection-free algorithm for the bandit convex optimization (BCO). We show that our algorithm achieves a sublinear regret of O(nT^4/5) (where T is the horizon and n is the dimension) for any bounded convex functions with uniformly bounded gradients. We also evaluate the performance of our algorithm against prior art on both synthetic and real data sets for portfolio selection and multiclass classification problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2019

Improved Regret Bounds for Projection-free Bandit Convex Optimization

We revisit the challenge of designing online algorithms for the bandit c...
research
11/21/2017

Regret Analysis for Continuous Dueling Bandit

The dueling bandit is a learning framework wherein the feedback informat...
research
09/11/2012

On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization

The problem of stochastic convex optimization with bandit feedback (in t...
research
02/01/2023

Bandit Convex Optimisation Revisited: FTRL Achieves Õ(t^1/2) Regret

We show that a kernel estimator using multiple function evaluations can ...
research
07/11/2016

Kernel-based methods for bandit convex optimization

We consider the adversarial convex bandit problem and we build the first...
research
11/13/2020

Convex Optimization with an Interpolation-based Projection and its Application to Deep Learning

Convex optimizers have known many applications as differentiable layers ...
research
05/24/2023

Optimal Rates for Bandit Nonstochastic Control

Linear Quadratic Regulator (LQR) and Linear Quadratic Gaussian (LQG) con...

Please sign up or login with your details

Forgot password? Click here to reset