Relax and Localize: From Value to Algorithms

04/04/2012
by   Alexander Rakhlin, et al.
0

We show a principled way of deriving online learning algorithms from a minimax analysis. Various upper bounds on the minimax value, previously thought to be non-constructive, are shown to yield algorithms. This allows us to seamlessly recover known methods and to derive new ones. Our framework also captures such "unorthodox" methods as Follow the Perturbed Leader and the R^2 forecaster. We emphasize that understanding the inherent complexity of the learning problem leads to the development of algorithms. We define local sequential Rademacher complexities and associated algorithms that allow us to obtain faster rates in online learning, similarly to statistical learning theory. Based on these localized complexities we build a general adaptive method that can take advantage of the suboptimality of the observed sequence. We present a number of new algorithms, including a family of randomized methods that use the idea of a "random playout". Several new versions of the Follow-the-Perturbed-Leader algorithms are presented, as well as methods based on the Littlestone's dimension, efficient methods for matrix completion with trace norm, and algorithms for the problems of transductive learning and prediction with static experts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2011

Efficient Transductive Online Learning via Randomized Rounding

Most traditional online learning algorithms are based on variants of mir...
research
01/06/2017

Follow the Compressed Leader: Faster Online Learning of Eigenvectors and Faster MMWU

The online problem of computing the top eigenvector is fundamental to ma...
research
02/12/2013

Competing With Strategies

We study the problem of online learning with a notion of regret defined ...
research
01/26/2015

Online Nonparametric Regression with General Loss Functions

This paper establishes minimax rates for online regression with arbitrar...
research
06/13/2020

Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games

We consider the problem of online learning and its application to solvin...
research
02/14/2022

An Application of Online Learning to Spacecraft Memory Dump Optimization

In this paper, we present a real-world application of online learning wi...
research
08/21/2015

Adaptive Online Learning

We propose a general framework for studying adaptive regret bounds in th...

Please sign up or login with your details

Forgot password? Click here to reset