An Online Learning Approach to Optimizing Time-Varying Costs of AoI

05/27/2021
by   Vishrant Tripathi, et al.
0

We consider systems that require timely monitoring of sources over a communication network, where the cost of delayed information is unknown, time-varying and possibly adversarial. For the single source monitoring problem, we design algorithms that achieve sublinear regret compared to the best fixed policy in hindsight. For the multiple source scheduling problem, we design a new online learning algorithm called Follow-the-Perturbed-Whittle-Leader and show that it has low regret compared to the best fixed scheduling policy in hindsight, while remaining computationally feasible. The algorithm and its regret analysis are novel and of independent interest to the study of online restless multi-armed bandit problems. We further design algorithms that achieve sublinear regret compared to the best dynamic policy when the environment is slowly varying. Finally, we apply our algorithms to a mobility tracking problem. We consider non-stationary and adversarial mobility models and illustrate the performance benefit of using our online learning algorithms compared to an oblivious scheduling policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2023

Online Learning under Adversarial Nonlinear Constraints

In many applications, learning systems are required to process continuou...
research
06/09/2021

ChaCha for Online AutoML

We propose the ChaCha (Champion-Challengers) algorithm for making an onl...
research
01/16/2021

Blind Optimal User Association in Small-Cell Networks

We learn optimal user association policies for traffic from different lo...
research
10/11/2022

Trading Off Resource Budgets for Improved Regret Bounds

In this work we consider a variant of adversarial online learning where ...
research
08/21/2022

Energy-aware Scheduling of Virtualized Base Stations in O-RAN with Online Learning

The design of Open Radio Access Network (O-RAN) compliant systems for co...
research
04/05/2022

Penalised FTRL With Time-Varying Constraints

In this paper we extend the classical Follow-The-Regularized-Leader (FTR...
research
01/22/2022

FALCON: Fast and Accurate Multipath Scheduling using Offline and Online Learning

Multipath transport protocols enable the concurrent use of different net...

Please sign up or login with your details

Forgot password? Click here to reset