Learning and Information in Stochastic Networks and Queues

05/18/2021
by   Neil Walton, et al.
9

We review the role of information and learning in the stability and optimization of queueing systems. In recent years, techniques from supervised learning, bandit learning and reinforcement learning have been applied to queueing systems supported by increasing role of information in decision making. We present observations and new results that help rationalize the application of these areas to queueing systems. We prove that the MaxWeight and BackPressure policies are an application of Blackwell's Approachability Theorem. This connects queueing theoretic results with adversarial learning. We then discuss the requirements of statistical learning for service parameter estimation. As an example, we show how queue size regret can be bounded when applying a perceptron algorithm to classify service. Next, we discuss the role of state information in improved decision making. Here we contrast the roles of epistemic information (information on uncertain parameters) and aleatoric information (information on an uncertain state). Finally we review recent advances in the theory of reinforcement learning and queueing, as well as, provide discussion on current research challenges.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/27/2021

The Statistical Complexity of Interactive Decision Making

A fundamental challenge in interactive learning and decision making, ran...
research
06/27/2022

On the Complexity of Adversarial Decision Making

A central problem in online learning and decision making – from bandits ...
research
11/02/2020

Augmenting Organizational Decision-Making with Deep Learning Algorithms: Principles, Promises, and Challenges

The current expansion of theory and research on artificial intelligence ...
research
01/12/2023

Approximate Information States for Worst-Case Control and Learning in Uncertain Systems

In this paper, we investigate discrete-time decision-making problems in ...
research
11/16/2020

Blind Decision Making: Reinforcement Learning with Delayed Observations

Reinforcement learning typically assumes that the state update from the ...
research
04/19/2021

Generalized-TODIM Method for Multi-criteria Decision Making with Basic Uncertain Information and its Application

Due to the fact that basic uncertain information provides a simple form ...
research
06/26/2019

Rethinking Formal Models of Partially Observable Multiagent Decision Making

Multiagent decision-making problems in partially observable environments...

Please sign up or login with your details

Forgot password? Click here to reset