Impossible Tuning Made Possible: A New Expert Algorithm and Its Applications

02/01/2021
by   Liyu Chen, et al.
0

We resolve the long-standing "impossible tuning" issue for the classic expert problem and show that, it is in fact possible to achieve regret O(√((ln d)∑_t ℓ_t,i^2)) simultaneously for all expert i in a T-round d-expert problem where ℓ_t,i is the loss for expert i in round t. Our algorithm is based on the Mirror Descent framework with a correction term and a weighted entropy regularizer. While natural, the algorithm has not been studied before and requires a careful analysis. We also generalize the bound to O(√((ln d)∑_t (ℓ_t,i-m_t,i)^2)) for any prediction vector m_t that the learner receives, and recover or improve many existing results by choosing different m_t. Furthermore, we use the same framework to create a master algorithm that combines a set of base algorithms and learns the best one with little overhead. The new guarantee of our master allows us to derive many new results for both the expert problem and more generally Online Linear Optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2013

Advice-Efficient Prediction with Expert Advice

Advice-efficient prediction with expert advice (in analogy to label-effi...
research
05/30/2022

Adversarial Bandits Robust to S-Switch Regret

We study the adversarial bandit problem under S number of switching best...
research
03/03/2014

Cascading Randomized Weighted Majority: A New Online Ensemble Learning Algorithm

With the increasing volume of data in the world, the best approach for l...
research
09/05/2019

More Adaptive Algorithms for Tracking the Best Expert

In this paper, we consider the problem of prediction with expert advice ...
research
10/15/2021

k – Online Policies and Fundamental Limits

This paper introduces and studies the k problem – a generalization of th...
research
10/27/2021

Fast rates for prediction with limited expert advice

We investigate the problem of minimizing the excess generalization error...
research
01/30/2018

Learning to Emulate an Expert Projective Cone Scheduler

Projective cone scheduling defines a large class of rate-stabilizing pol...

Please sign up or login with your details

Forgot password? Click here to reset