DeepAI AI Chat
Log In Sign Up

Deep Reinforcement Learning with Symmetric Prior for Predictive Power Allocation to Mobile Users

by   Jianyu Zhao, et al.
Beihang University
NetEase, Inc

Deep reinforcement learning has been applied for a variety of wireless tasks, which is however known with high training and inference complexity. In this paper, we resort to deep deterministic policy gradient (DDPG) algorithm to optimize predictive power allocation among K mobile users requesting video streaming, which minimizes the energy consumption of the network under the no-stalling constraint of each user. To reduce the sampling complexity and model size of the DDPG, we exploit a kind of symmetric prior inherent in the actor and critic networks: permutation invariant and equivariant properties, to design the neural networks. Our analysis shows that the free model parameters of the DDPG can be compressed by 2/K^2. Simulation results demonstrate that the episodes required by the learning model with the symmetric prior to achieve the same performance as the vanilla policy reduces by about one third when K = 10.


Graph Reinforcement Learning for Predictive Power Allocation to Mobile Users

Allocating resources with future channels can save resource to ensure qu...

Accelerating Deep Reinforcement Learning With the Aid of a Partial Model: Power-Efficient Predictive Video Streaming

Predictive power allocation is conceived for power-efficient video strea...

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

The model-based power allocation algorithm has been investigated for dec...

Joint Power Allocation and Beamformer for mmW-NOMA Downlink Systems by Deep Reinforcement Learning

The high demand for data rate in the next generation of wireless communi...

A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning

Prescribing optimal operation based on the condition of the system and, ...

Generalization of Deep Reinforcement Learning for Jammer-Resilient Frequency and Power Allocation

We tackle the problem of joint frequency and power allocation while emph...

AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning

Deep reinforcement learning has achieved great success in various fields...