Online Prediction in Sub-linear Space

07/16/2022
by   Binghui Peng, et al.
0

We provide the first sub-linear space and sub-linear regret algorithm for online learning with expert advice (against an oblivious adversary), addressing an open question raised recently by Srinivas, Woodruff, Xu and Zhou (STOC 2022). We also demonstrate a separation between oblivious and (strong) adaptive adversaries by proving a linear memory lower bound of any sub-linear regret algorithm against an adaptive adversary. Our algorithm is based on a novel pool selection procedure that bypasses the traditional wisdom of leader selection for online learning, and a generic reduction that transforms any weakly sub-linear regret o(T) algorithm to T^1-α regret algorithm, which may be of independent interest. Our lower bound utilizes the connection of no-regret learning and equilibrium computation in zero-sum games, leading to a proof of a strong lower bound against an adaptive adversary.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2023

Near Optimal Memory-Regret Tradeoff for Online Learning

In the experts problem, on each of T days, an agent needs to follow the ...
research
06/27/2012

Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret

Online learning algorithms are designed to learn even when their input i...
research
05/22/2023

Hierarchical Partitioning Forecaster

In this work we consider a new family of algorithms for sequential predi...
research
01/24/2023

Gossiped and Quantized Online Multi-Kernel Learning

In instances of online kernel learning where little prior information is...
research
02/19/2019

A sub-quadratic algorithm for the longest common increasing subsequence problem

The Longest Common Increasing Subsequence problem (LCIS) is a natural va...
research
04/01/2018

Online learning with graph-structured feedback against adaptive adversaries

We derive upper and lower bounds for the policy regret of T-round online...
research
02/16/2021

Smoothed Analysis with Adaptive Adversaries

We prove novel algorithmic guarantees for several online problems in the...

Please sign up or login with your details

Forgot password? Click here to reset