Model-Free Learning of Optimal Ergodic Policies in Wireless Systems

11/10/2019
by   Dionysios S. Kalogerias, et al.
15

Learning optimal resource allocation policies in wireless systems can be effectively achieved by formulating finite dimensional constrained programs which depend on system configuration, as well as the adopted learning parameterization. The interest here is in cases where system models are unavailable, prompting methods that probe the wireless system with candidate policies, and then use observed performance to determine better policies. This generic procedure is difficult because of the need to cull accurate gradient estimates out of these limited system queries. This paper constructs and exploits smoothed surrogates of constrained ergodic resource allocation problems, the gradients of the former being representable exactly as averages of finite differences that can be obtained through limited system probing. Leveraging this unique property, we develop a new model-free primal-dual algorithm for learning optimal ergodic resource allocations, while we rigorously analyze the relationships between original policy search problems and their surrogates, in both primal and dual domains. First, we show that both primal and dual domain surrogates are uniformly consistent approximations of their corresponding original finite dimensional counterparts. Upon further assuming the use of near-universal policy parameterizations, we also develop explicit bounds on the gap between optimal values of initial, infinite dimensional resource allocation problems, and dual values of their parameterized smoothed surrogates. In fact, we show that this duality gap decreases at a linear rate relative to smoothing and universality parameters. Thus, it can be made arbitrarily small at will, also justifying our proposed primal-dual algorithmic recipe. Numerical simulations confirm the effectiveness of our approach.

READ FULL TEXT

page 1

page 9

page 10

page 11

page 12

research
07/21/2018

Learning Optimal Resource Allocations in Wireless Systems

This paper considers the design of optimal resource allocation policies ...
research
08/23/2021

Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration

Wireless systems resource allocation refers to perpetual and challenging...
research
05/01/2023

Robust and Reliable Stochastic Resource Allocation via Tail Waterfilling

Stochastic allocation of resources in the context of wireless systems ul...
research
11/22/2020

Primal-dual Learning for the Model-free Risk-constrained Linear Quadratic Regulator

Risk-aware control, though with promise to tackle unexpected events, req...
research
06/21/2019

Optimal WDM Power Allocation via Deep Learning for Radio on Free Space Optics Systems

Radio on Free Space Optics (RoFSO), as a universal platform for heteroge...
research
03/07/2022

Learning Resilient Radio Resource Management Policies with Graph Neural Networks

We consider the problems of downlink user selection and power control in...
research
01/29/2022

Learning Stochastic Graph Neural Networks with Constrained Variance

Stochastic graph neural networks (SGNNs) are information processing arch...

Please sign up or login with your details

Forgot password? Click here to reset