Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors

07/22/2018
by   Nikhilesh Sharma, et al.
0

We investigate an energy-harvesting wireless sensor transmitting latency-sensitive data over a fading channel. The sensor injects captured data packets into its transmission queue and relies on ambient energy harvested from the environment to transmit them. We aim to find the optimal scheduling policy that decides whether or not to transmit the queue's head-of-line packet at each transmission opportunity such that the expected packet queuing delay is minimized given the available harvested energy. No prior knowledge of the stochastic processes that govern the channel, captured data, or harvested energy dynamics are assumed, thereby necessitating the use of online learning to optimize the scheduling policy. We formulate this scheduling problem as a Markov decision process (MDP) and analyze the structural properties of its optimal value function. In particular, we show that it is non-decreasing and has increasing differences in the queue backlog and that it is non-increasing and has increasing differences in the battery state. We exploit this structure to formulate a novel accelerated reinforcement learning (RL) algorithm to solve the scheduling problem online at a much faster learning rate, while limiting the induced computational complexity. Our experiments demonstrate that the proposed algorithm closely approximates the performance of an optimal offline solution that requires a priori knowledge of the channel, captured data, and harvested energy dynamics. Simultaneously, by leveraging the value function's structure, our approach achieves competitive performance relative to a state-of-the-art RL algorithm, at potentially orders of magnitude lower complexity. Finally, considerable performance gains are demonstrated over the well-known Q-learning algorithm.

READ FULL TEXT
research
03/26/2018

Structural Properties of Optimal Transmission Policies for Delay-Sensitive Energy Harvesting Wireless Sensors

We consider an energy harvesting sensor transmitting latency-sensitive d...
research
10/08/2019

Counterexamples on the monotonicity of delay optimal strategies for energy harvesting transmitters

We consider cross-layer design of delay optimal transmission strategies ...
research
08/23/2013

Delay Optimal Scheduling for Energy Harvesting Based Communications

Green communication attracts increasing research interest recently. Equi...
research
09/28/2020

Delay Optimal Cross-Layer Scheduling Over Markov Channels with Power Constraint

We consider a scenario where a power constrained transmitter delivers ra...
research
08/23/2018

Reinforcement Learning Approach for RF-Powered Cognitive Radio Network with Ambient Backscatter

For an RF-powered cognitive radio network with ambient backscattering ca...
research
05/17/2018

Fast reinforcement learning for decentralized MAC optimization

In this paper, we propose a novel decentralized framework for optimizing...
research
03/01/2022

Distributional Reinforcement Learning for Scheduling of (Bio)chemical Production Processes

Reinforcement Learning (RL) has recently received significant attention ...

Please sign up or login with your details

Forgot password? Click here to reset