Online Learning with Predictable Sequences

08/18/2012
by   Alexander Rakhlin, et al.
0

We present methods for online linear optimization that take advantage of benign (as opposed to worst-case) sequences. Specifically if the sequence encountered by the learner is described well by a known "predictable process", the algorithms presented enjoy tighter bounds as compared to the typical worst case bounds. Additionally, the methods achieve the usual worst-case regret bounds if the sequence is not benign. Our approach can be seen as a way of adding prior knowledge about the sequence within the paradigm of online learning. The setting is shown to encompass partial and side information. Variance and path-length bounds can be seen as particular examples of online learning with simple predictable sequences. We further extend our methods and results to include competing with a set of possible predictable processes (models), that is "learning" the predictable process itself concurrently with using it to obtain better regret guarantees. We show that such model selection is possible under various assumptions on the available feedback. Our results suggest a promising direction of further research with potential applications to stock market and time series prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2022

Oracle-Efficient Online Learning for Beyond Worst-Case Adversaries

In this paper, we study oracle-efficient algorithms for beyond worst-cas...
research
01/26/2015

Online Optimization : Competing with Dynamic Comparators

Recent literature on online learning has focused on developing adaptive ...
research
07/24/2018

Decision Variance in Online Learning

Online learning has classically focused on the expected behaviour of lea...
research
05/15/2020

On Learnability under General Stochastic Processes

Statistical learning theory under independent and identically distribute...
research
02/02/2021

Majorizing Measures, Sequential Complexities, and Online Learning

We introduce the technique of generic chaining and majorizing measures f...
research
03/24/2019

Algorithms and Improved bounds for online learning under finite hypothesis class

Online learning is the process of answering a sequence of questions base...
research
06/21/2019

Thompson Sampling for Adversarial Bit Prediction

We study the Thompson sampling algorithm in an adversarial setting, spec...

Please sign up or login with your details

Forgot password? Click here to reset