Reinforcement Learning Based Power Control for Reliable Wireless Transmission

02/13/2022
by   Chongtao Guo, et al.
0

In this paper, we investigate a sequential power allocation problem over fast varying channels, aiming to minimize the expected sum power while guaranteeing the transmission success probability. In particular, a reinforcement learning framework is constructed with appropriate reward design so that the optimal policy maximizes the Lagrangian of the primal problem, where the maximizer of the Lagrangian is shown to have several good properties. For the model-based case, a fast converging algorithm is proposed to find the optimal Lagrange multiplier and thus the corresponding optimal policy. For the model-free case, we develop a three-stage strategy, composed in order of online sampling, offline learning, and online operation, where a backward Q-learning with full exploitation of sampled channel realizations is designed to accelerate the learning process. According to our simulation, the proposed reinforcement learning framework can solve the primal optimization problem from the dual perspective. Moreover, the model-free strategy achieves a performance close to that of the optimal model-based algorithm.

READ FULL TEXT
research
06/13/2023

A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning

Offline constrained reinforcement learning (RL) aims to learn a policy t...
research
04/22/2023

Reinforcement Learning with an Abrupt Model Change

The problem of reinforcement learning is considered where the environmen...
research
11/22/2020

Primal-dual Learning for the Model-free Risk-constrained Linear Quadratic Regulator

Risk-aware control, though with promise to tackle unexpected events, req...
research
06/21/2019

Optimal WDM Power Allocation via Deep Learning for Radio on Free Space Optics Systems

Radio on Free Space Optics (RoFSO), as a universal platform for heteroge...
research
02/25/2019

Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots

Biological studies show that hummingbirds can perform extreme aerobatic ...
research
10/22/2020

Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning

The future of mobility-as-a-Service (Maas)should embrace an integrated s...
research
04/11/2018

Cost-Aware Learning and Optimization for Opportunistic Spectrum Access

In this paper, we investigate cost-aware joint learning and optimization...

Please sign up or login with your details

Forgot password? Click here to reset