An Index Policy for Minimizing the Uncertainty-of-Information of Markov Sources

12/06/2022
by   Gongpu Chen, et al.
0

This paper focuses on the information freshness of finite-state Markov sources, using the uncertainty of information (UoI) as the performance metric. Measured by Shannon's entropy, UoI can capture not only the transition dynamics of the Markov source but also the different evolutions of information quality caused by the different values of the last observation. We consider an information update system with M finite-state Markov sources transmitting information to a remote monitor via m communication channels. Our goal is to explore the optimal scheduling policy to minimize the sum-UoI of the Markov sources. The problem is formulated as a restless multi-armed bandit (RMAB). We relax the RMAB and then decouple the relaxed problem into M single bandit problems. Analyzing the single bandit problem provides useful properties with which the relaxed problem reduces to maximizing a concave and piecewise linear function, allowing us to develop a gradient method to solve the relaxed problem and obtain its optimal policy. By rounding up the optimal policy for the relaxed problem, we obtain an index policy for the original RMAB problem. Notably, the proposed index policy is universal in the sense that it applies to general RMABs with bounded cost functions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2021

Uncertainty-of-Information Scheduling: A Restless Multi-armed Bandit Framework

This paper proposes using the uncertainty of information (UoI), measured...
research
08/27/2019

A Whittle Index Approach to Minimizing Functions of Age of Information

We consider a setting where multiple active sources send real-time updat...
research
09/01/2023

Controlled Martingale Problems And Their Markov Mimics

In this article we prove under suitable assumptions that the marginals o...
research
03/15/2016

Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains

Sequential decision making under uncertainty is studied in a mixed obser...
research
09/18/2022

DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs

We consider the problem of learning the optimal threshold policy for con...
research
07/06/2023

PCL-Indexability and Whittle Index for Restless Bandits with General Observation Models

In this paper, we consider a general observation model for restless mult...
research
03/19/2018

Impulsive Control for G-AIMD Dynamics with Relaxed and Hard Constraints

Motivated by various applications from Internet congestion control to po...

Please sign up or login with your details

Forgot password? Click here to reset