Thompson Sampling for Adversarial Bit Prediction

06/21/2019
by   Yuval Lewi, et al.
0

We study the Thompson sampling algorithm in an adversarial setting, specifically, for adversarial bit prediction. We characterize the bit sequences with the smallest and largest expected regret. Among sequences of length T with k < T/2 zeros, the sequences of largest regret consist of alternating zeros and ones followed by the remaining ones, and the sequence of smallest regret consists of ones followed by zeros. We also bound the regret of those sequences, the worse case sequences have regret O(√(T)) and the best case sequence have regret O(1). We extend our results to a model where false positive and false negative errors have different weights. We characterize the sequences with largest expected regret in this generalized setting, and derive their regret bounds. We also show that there are sequences with O(1) regret.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2020

Fast Rates for Online Prediction with Abstention

In the setting of sequential prediction of individual {0, 1}-sequences w...
research
07/02/2018

Adaptation to Easy Data in Prediction with Limited Advice

We derive an online learning algorithm with improved regret guarantees f...
research
07/06/2022

Online Bilevel Optimization: Regret Analysis of Online Alternating Gradient Methods

Online optimization is a well-established optimization paradigm that aim...
research
08/18/2012

Online Learning with Predictable Sequences

We present methods for online linear optimization that take advantage of...
research
05/29/2018

Uniform regret bounds over R^d for the sequential linear regression problem with the square loss

We consider the setting of online linear regression for arbitrary determ...
research
11/20/2016

Linear Thompson Sampling Revisited

We derive an alternative proof for the regret of Thompson sampling () in...
research
05/28/2021

On Stricter Reachable Repetitiveness Measures*

The size b of the smallest bidirectional macro scheme, which is arguably...

Please sign up or login with your details

Forgot password? Click here to reset