Dying Experts: Efficient Algorithms with Optimal Regret Bounds

10/29/2019
by   Hamid Shayestehmanesh, et al.
0

We study a variant of decision-theoretic online learning in which the set of experts that are available to Learner can shrink over time. This is a restricted version of the well-studied sleeping experts problem, itself a generalization of the fundamental game of prediction with expert advice. Similar to many works in this direction, our benchmark is the ranking regret. Various results suggest that achieving optimal regret in the fully adversarial sleeping experts problem is computationally hard. This motivates our relaxation where any expert that goes to sleep will never again wake up. We call this setting "dying experts" and study it in two different cases: the case where the learner knows the order in which the experts will die and the case where the learner does not. In both cases, we provide matching upper and lower bounds on the ranking regret in the fully adversarial setting. Furthermore, we present new, computationally efficient algorithms that obtain our optimal upper bounds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2020

Adversarial Online Learning with Changing Action Sets: Efficient Algorithms with Approximate Regret Bounds

We revisit the problem of online learning with sleeping experts/bandits:...
research
03/14/2023

Information-Theoretic Regret Bounds for Bandits with Fixed Expert Advice

We investigate the problem of bandits with expert advice when the expert...
research
10/15/2021

k – Online Policies and Fundamental Limits

This paper introduces and studies the k problem – a generalization of th...
research
04/13/2021

Sequential Ski Rental Problem

The classical 'buy or rent' ski-rental problem was recently considered i...
research
05/24/2023

No-Regret Online Prediction with Strategic Experts

We study a generalization of the online binary prediction with expert ad...
research
08/11/2022

Regret Analysis for Hierarchical Experts Bandit Problem

We study an extension of standard bandit problem in which there are R la...
research
01/28/2020

Fast Rates for Online Prediction with Abstention

In the setting of sequential prediction of individual {0, 1}-sequences w...

Please sign up or login with your details

Forgot password? Click here to reset