Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints

03/04/2015
by   Alexander Rakhlin, et al.
0

We study online prediction where regret of the algorithm is measured against a benchmark defined via evolving constraints. This framework captures online prediction on graphs, as well as other prediction problems with combinatorial structure. A key aspect here is that finding the optimal benchmark predictor (even in hindsight, given all the data) might be computationally hard due to the combinatorial nature of the constraints. Despite this, we provide polynomial-time prediction algorithms that achieve low regret against combinatorial benchmark sets. We do so by building improper learning algorithms based on two ideas that work together. The first is to alleviate part of the computational burden through random playout, and the second is to employ Lasserre semidefinite hierarchies to approximate the resulting integer program. Interestingly, for our prediction algorithms, we only need to compute the values of the semidefinite programs and not the rounded solutions. However, the integrality gap for Lasserre hierarchy does enter the generic regret bound in terms of Rademacher complexity of the benchmark set. This establishes a trade-off between the computation time and the regret bound of the algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2022

Universal Caching

In the learning literature, the performance of an online policy is commo...
research
02/17/2020

Statistically Efficient, Polynomial Time Algorithms for Combinatorial Semi Bandits

We consider combinatorial semi-bandits over a set of arms X⊂{0,1}^d wher...
research
07/12/2019

Online-Learning for min-max discrete problems

We study various discrete nonlinear combinatorial optimization problems ...
research
08/31/2016

A Tutorial on Online Supervised Learning with Applications to Node Classification in Social Networks

We revisit the elegant observation of T. Cover '65 which, perhaps, is no...
research
01/18/2022

Safe Online Bid Optimization with Return-On-Investment and Budget Constraints subject to Uncertainty

In online marketing, the advertisers' goal is usually a tradeoff between...
research
07/15/2020

Improved algorithms for online load balancing

We consider an online load balancing problem and its extensions in the f...

Please sign up or login with your details

Forgot password? Click here to reset