On Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities

10/13/2015
by   Alexander Rakhlin, et al.
0

We study an equivalence of (i) deterministic pathwise statements appearing in the online learning literature (termed regret bounds), (ii) high-probability tail bounds for the supremum of a collection of martingales (of a specific form arising from uniform laws of large numbers for martingales), and (iii) in-expectation bounds for the supremum. By virtue of the equivalence, we prove exponential tail bounds for norms of Banach space valued martingales via deterministic regret bounds for the online mirror descent algorithm with an adaptive step size. We extend these results beyond the linear structure of the Banach space: we define a notion of martingale type for general classes of real-valued functions and show its equivalence (up to a logarithmic factor) to various sequential complexities of the class (in particular, the sequential Rademacher complexity and its offset version). For classes with the general martingale type 2, we exhibit a finer notion of variation that allows partial adaptation to the function indexing the martingale. Our proof technique rests on sequential symmetrization and on certifying the existence of regret minimization strategies for certain online prediction problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/22/2021

Adversarial Laws of Large Numbers and Optimal Regret in Online Classification

Laws of large numbers guarantee that given a large enough sample from so...
research
04/23/2019

On laws exhibiting universal ordering under stochastic restart

For each of (i) arbitrary stochastic reset, (ii) deterministic reset wit...
research
01/05/2011

Sparsity regret bounds for individual sequences in online linear regression

We consider the problem of online linear regression on arbitrary determi...
research
08/15/2023

High-Probability Risk Bounds via Sequential Predictors

Online learning methods yield sequential regret bounds under minimal ass...
research
05/29/2018

Uniform regret bounds over R^d for the sequential linear regression problem with the square loss

We consider the setting of online linear regression for arbitrary determ...
research
08/21/2015

Adaptive Online Learning

We propose a general framework for studying adaptive regret bounds in th...
research
08/12/2021

Agnostic Online Learning and Excellent Sets

We use algorithmic methods from online learning to revisit a key idea fr...

Please sign up or login with your details

Forgot password? Click here to reset