Learning Robust Scheduling with Search and Attention

11/15/2021
by   David Sandberg, et al.
0

Allocating physical layer resources to users based on channel quality, buffer size, requirements and constraints represents one of the central optimization problems in the management of radio resources. The solution space grows combinatorially with the cardinality of each dimension making it hard to find optimal solutions using an exhaustive search or even classical optimization algorithms given the stringent time requirements. This problem is even more pronounced in MU-MIMO scheduling where the scheduler can assign multiple users to the same time-frequency physical resources. Traditional approaches thus resort to designing heuristics that trade optimality in favor of feasibility of execution. In this work we treat the MU-MIMO scheduling problem as a tree-structured combinatorial problem and, borrowing from the recent successes of AlphaGo Zero, we investigate the feasibility of searching for the best performing solutions using a combination of Monte Carlo Tree Search and Reinforcement Learning. To cater to the nature of the problem at hand, like the lack of an intrinsic ordering of the users as well as the importance of dependencies between combinations of users, we make fundamental modifications to the neural network architecture by introducing the self-attention mechanism. We then demonstrate that the resulting approach is not only feasible but vastly outperforms state-of-the-art heuristic-based scheduling approaches in the presence of measurement uncertainties and finite buffers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/04/2012

Intelligent Search Heuristics for Cost Based Scheduling

Nurse scheduling is a difficult optimization problem with multiple const...
research
10/22/2020

Exploring search space trees using an adapted version of Monte Carlo tree search for combinatorial optimization problems

In this article, a novel approach to solve combinatorial optimization pr...
research
07/24/2020

Monte-Carlo Tree Search as Regularized Policy Optimization

The combination of Monte-Carlo tree search (MCTS) with deep reinforcemen...
research
04/04/2021

SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems

We study combinatorial problems with real world applications such as mac...
research
10/12/2011

Proactive Algorithms for Job Shop Scheduling with Probabilistic Durations

Most classical scheduling formulations assume a fixed and known duration...
research
05/18/2018

Multifunction Cognitive Radar Task Scheduling Using Monte Carlo Tree Search and Policy Networks

A modern radar may be designed to perform multiple functions, such as su...
research
12/24/2021

Gibbs Distribution Based Antenna Splitting and User Scheduling in Full Duplex Massive MIMO Systems

A Gibbs distribution based combinatorial optimization algorithm for join...

Please sign up or login with your details

Forgot password? Click here to reset