Deep Reinforcement Learning with Symmetric Prior for Predictive Power Allocation to Mobile Users

02/10/2021
by   Jianyu Zhao, et al.
0

Deep reinforcement learning has been applied for a variety of wireless tasks, which is however known with high training and inference complexity. In this paper, we resort to deep deterministic policy gradient (DDPG) algorithm to optimize predictive power allocation among K mobile users requesting video streaming, which minimizes the energy consumption of the network under the no-stalling constraint of each user. To reduce the sampling complexity and model size of the DDPG, we exploit a kind of symmetric prior inherent in the actor and critic networks: permutation invariant and equivariant properties, to design the neural networks. Our analysis shows that the free model parameters of the DDPG can be compressed by 2/K^2. Simulation results demonstrate that the episodes required by the learning model with the symmetric prior to achieve the same performance as the vanilla policy reduces by about one third when K = 10.

READ FULL TEXT
research
03/08/2022

Graph Reinforcement Learning for Predictive Power Allocation to Mobile Users

Allocating resources with future channels can save resource to ensure qu...
research
03/21/2020

Accelerating Deep Reinforcement Learning With the Aid of a Partial Model: Power-Efficient Predictive Video Streaming

Predictive power allocation is conceived for power-efficient video strea...
research
01/22/2019

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

The model-based power allocation algorithm has been investigated for dec...
research
09/14/2020

Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks

Deep reinforcement learning offers a model-free alternative to supervise...
research
02/04/2023

Generalization of Deep Reinforcement Learning for Jammer-Resilient Frequency and Power Allocation

We tackle the problem of joint frequency and power allocation while emph...
research
01/20/2022

A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning

Prescribing optimal operation based on the condition of the system and, ...
research
03/30/2018

Cache-Enabled Dynamic Rate Allocation via Deep Self-Transfer Reinforcement Learning

Caching and rate allocation are two promising approaches to support vide...

Please sign up or login with your details

Forgot password? Click here to reset