New Potential-Based Bounds for Prediction with Expert Advice

11/05/2019
by   Vladimir A. Kobzar, et al.
0

This work addresses the classic machine learning problem of online prediction with expert advice. We consider the finite-horizon version of this zero-sum, two-person game. Using verification arguments from optimal control theory, we view the task of finding better lower and upper bounds on the value of the game (regret) as the problem of finding better sub- and supersolutions of certain partial differential equations (PDEs). These sub- and supersolutions serve as the potentials for player and adversary strategies, which lead to the corresponding bounds. Our techniques extend in a nonasymptotic setting the recent work of Drenska and Kohn (J. Nonlinear Sci. 2019), which showed that the asymptotically optimal value function is the unique solution of an associated nonlinear PDE. To get explicit bounds, we use closed-form solutions of specific PDEs. Our bounds hold for any fixed number of experts and any time-horizon T; in certain regimes (which we identify) they improve upon the previous state-of-the-art. For up to three experts, our bounds provide the asymptotically optimal leading order term. Therefore, we provide a continuum perspective on recent work on optimal strategies for the case of N ≤ 3 experts. We expect that our framework could be used to systematize and advance theory and applications of online learning in other settings as well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2019

New Potential-Based Bounds for the Geometric-Stopping Version of Prediction with Expert Advice

This work addresses the classic machine learning problem of online predi...
research
04/25/2019

Prediction with Expert Advice: a PDE Perspective

This work addresses a classic problem of online prediction with expert a...
research
07/23/2022

A New Approach to Drifting Games, Based on Asymptotically Optimal Potentials

We develop a new approach to drifting games, a class of two-person games...
research
07/24/2020

A PDE Approach to the Prediction of a Binary Sequence with Advice from Two History-Dependent Experts

The prediction of a binary sequence is a classic example of online machi...
research
07/31/2020

Online Prediction With History-Dependent Experts: The General Case

We study the problem of prediction of binary sequences with expert advic...
research
10/31/2020

Prediction against limited adversary

We study the problem of prediction with expert advice with adversarial c...
research
08/31/2020

Asymptotically optimal strategies for online prediction with history-dependent experts

We establish sharp asymptotically optimal strategies for the problem of ...

Please sign up or login with your details

Forgot password? Click here to reset