Smoothed Online Combinatorial Optimization Using Imperfect Predictions

04/23/2022
by   Kai Wang, et al.
0

Smoothed online combinatorial optimization considers a learner who repeatedly chooses a combinatorial decision to minimize an unknown changing cost function with a penalty on switching decisions in consecutive rounds. We study smoothed online combinatorial optimization problems when an imperfect predictive model is available, where the model can forecast the future cost functions with uncertainty. We show that using predictions to plan for a finite time horizon leads to regret dependent on the total predictive uncertainty and an additional switching cost. This observation suggests choosing a suitable planning window to balance between uncertainty and switching cost, which leads to an online algorithm with guarantees on the upper and lower bounds of the cumulative regret. Lastly, we provide an iterative algorithm to approximately solve the planning problem in real-time. Empirically, our algorithm shows a significant improvement in cumulative regret compared to other baselines in synthetic online distributed streaming problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2015

First-order regret bounds for combinatorial semi-bandits

We consider the problem of online combinatorial optimization under semi-...
research
11/28/2019

Understand Dynamic Regret with Switching Cost for Online Decision Making

As a metric to measure the performance of an online method, dynamic regr...
research
03/21/2021

Online Convex Optimization with Continuous Switching Constraint

In many sequential decision making applications, the change of decision ...
research
04/20/2012

Regret in Online Combinatorial Optimization

We address online linear optimization problems when the possible actions...
research
05/02/2022

Smoothed Online Convex Optimization Based on Discounted-Normal-Predictor

In this paper, we investigate an online prediction strategy named as Dis...
research
11/24/2021

Combinatorial Optimization Problems with Balanced Regret

For decision making under uncertainty, min-max regret has been establish...
research
03/17/2015

Importance weighting without importance weights: An efficient algorithm for combinatorial semi-bandits

We propose a sample-efficient alternative for importance weighting for s...

Please sign up or login with your details

Forgot password? Click here to reset