History-Restricted Online Learning

05/28/2022
βˆ™
by   Jon Schneider, et al.
βˆ™
0
βˆ™

We introduce the concept of history-restricted no-regret online learning algorithms. An online learning algorithm π’œ is M-history-restricted if its output at time t can be written as a function of the M previous rewards. This class of online learning algorithms is quite natural to consider from many perspectives: they may be better models of human agents and they do not store long-term information (thereby ensuring β€œthe right to be forgotten”). We first demonstrate that a natural approach to constructing history-restricted algorithms from mean-based no-regret learning algorithms (e.g. running Hedge over the last M rounds) fails, and that such algorithms incur linear regret. We then construct a history-restricted algorithm that achieves a per-round regret of Θ(1/√(M)), which we complement with a tight lower bound. Finally, we empirically explore distributions where history-restricted online learners have favorable performance compared to other no-regret algorithms.

READ FULL TEXT

page 36

page 37

page 38

research
βˆ™ 11/26/2012

The Interplay Between Stability and Regret in Online Learning

This paper considers the stability of online learning algorithms and its...
research
βˆ™ 02/02/2021

Strongly Adaptive OCO with Memory

Recent progress in online control has popularized online learning with m...
research
βˆ™ 02/11/2020

Online Learning with Imperfect Hints

We consider a variant of the classical online linear optimization proble...
research
βˆ™ 08/09/2014

Normalized Online Learning

We introduce online learning algorithms which are independent of feature...
research
βˆ™ 11/13/2018

A Local Regret in Nonconvex Online Learning

We consider an online learning process to forecast a sequence of outcome...
research
βˆ™ 11/28/2019

Communication-Efficient Distributed Online Learning with Kernels

We propose an efficient distributed online learning protocol for low-lat...
research
βˆ™ 06/30/2021

Koopman Spectrum Nonlinear Regulator and Provably Efficient Online Learning

Most modern reinforcement learning algorithms optimize a cumulative sing...

Please sign up or login with your details

Forgot password? Click here to reset