Adaptive Online Learning

08/21/2015
by   Dylan J. Foster, et al.
0

We propose a general framework for studying adaptive regret bounds in the online learning framework, including model selection bounds and data-dependent bounds. Given a data- or model-dependent bound we ask, "Does there exist some algorithm achieving this bound?" We show that modifications to recently introduced sequential complexity measures can be used to answer this question by providing sufficient conditions under which adaptive rates can be achieved. In particular each adaptive rate induces a set of so-called offset complexity measures, and obtaining small upper bounds on these quantities is sufficient to demonstrate achievability. A cornerstone of our analysis technique is the use of one-sided tail inequalities to bound suprema of offset random processes. Our framework recovers and improves a wide variety of adaptive bounds including quantile bounds, second-order data-dependent bounds, and small loss bounds. In addition we derive a new type of adaptive bound for online linear optimization based on the spectral norm, as well as a new online PAC-Bayes theorem that holds for countably infinite sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2017

ZigZag: A new approach to adaptive online learning

We develop a novel family of algorithms for the online learning setting ...
research
09/05/2019

More Adaptive Algorithms for Tracking the Best Expert

In this paper, we consider the problem of prediction with expert advice ...
research
05/31/2023

Online-to-PAC Conversions: Generalization Bounds via Regret Analysis

We present a new framework for deriving bounds on the generalization bou...
research
02/27/2015

Second-order Quantile Methods for Experts and Combinatorial Games

We aim to design strategies for sequential decision making that adjust t...
research
05/31/2022

Online PAC-Bayes Learning

Most PAC-Bayesian bounds hold in the batch learning setting where data i...
research
10/13/2015

On Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities

We study an equivalence of (i) deterministic pathwise statements appeari...
research
04/04/2012

Relax and Localize: From Value to Algorithms

We show a principled way of deriving online learning algorithms from a m...

Please sign up or login with your details

Forgot password? Click here to reset