On Monotonicity of the Optimal Transmission Policy in Cross-layer Adaptive m-QAM Modulation

08/21/2015
by   Ni Ding, et al.
0

This paper considers a cross-layer adaptive modulation system that is modeled as a Markov decision process (MDP). We study how to utilize the monotonicity of the optimal transmission policy to relieve the computational complexity of dynamic programming (DP). In this system, a scheduler controls the bit rate of the m-quadrature amplitude modulation (m-QAM) in order to minimize the long-term losses incurred by the queue overflow in the data link layer and the transmission power consumption in the physical layer. The work is done in two steps. Firstly, we observe the L-natural-convexity and submodularity of DP to prove that the optimal policy is always nondecreasing in queue occupancy/state and derive the sufficient condition for it to be nondecreasing in both queue and channel states. We also show that, due to the L-natural-convexity of DP, the variation of the optimal policy in queue state is restricted by a bounded marginal effect: The increment of the optimal policy between adjacent queue states is no greater than one. Secondly, we use the monotonicity results to present two low complexity algorithms: monotonic policy iteration (MPI) based on L-natural-convexity and discrete simultaneous perturbation stochastic approximation (DSPSA). We run experiments to show that the time complexity of MPI based on L-natural-convexity is much lower than that of DP and the conventional MPI that is based on submodularity and DSPSA is able to adaptively track the optimal policy when the system parameters change.

READ FULL TEXT

page 14

page 18

research
10/08/2019

Counterexamples on the monotonicity of delay optimal strategies for energy harvesting transmitters

We consider cross-layer design of delay optimal transmission strategies ...
research
09/28/2020

Delay Optimal Cross-Layer Scheduling Over Markov Channels with Power Constraint

We consider a scenario where a power constrained transmitter delivers ra...
research
05/14/2012

Approximate Modified Policy Iteration

Modified policy iteration (MPI) is a dynamic programming (DP) algorithm ...
research
03/02/2020

Timely Synchronization with Sporadic Status Changes

In this paper, we consider a status updating system where the transmitte...
research
01/30/2023

SMDP-Based Dynamic Batching for Efficient Inference on GPU-Based Platforms

In up-to-date machine learning (ML) applications on cloud or edge comput...
research
01/13/2018

Queue-aware Energy Efficient Control for Dense Wireless Networks

We consider the problem of long term power allocation in dense wireless ...
research
01/24/2022

Structural Properties of Optimal Fidelity Selection Policies for Human-in-the-loop Queues

We study optimal fidelity selection for a human operator servicing a que...

Please sign up or login with your details

Forgot password? Click here to reset