Online Boosting with Bandit Feedback

07/23/2020
by   Nataly Brukhim, et al.
0

We consider the problem of online boosting for regression tasks, when only limited information is available to the learner. We give an efficient regret minimization method that has two implications: an online boosting algorithm with noisy multi-point bandit feedback, and a new projection-free online convex optimization algorithm with stochastic gradient, that improves state-of-the-art guarantees in terms of efficiency.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset