Transportation Polytope and its Applications in Parallel Server Systems

08/11/2021
by   Sushil Mahavir Varma, et al.
0

Parallel server system is a stochastic processing network widely studied in the context of manufacturing, supply chain, ride-hailing, call centers, etc. Heterogeneous customers arrive into the system and only a subset of servers can serve any given customer type depending on the flexibility graph. As the flexibility can be overlapping, scheduling decisions must be made to minimize the delay experienced by the customers. Exact analysis of delay is not possible and so, we consider the heavy traffic asymptotic regime, wherein the arrival rate is loaded up to approach the service rate. We consider the general case when the so called complete resource pooling (CRP) is not satisfied. Recent work established that when the MaxWeight scheduling algorithm is used, the state space collapses (SSC) into a lower dimensional sub-space. Building upon this result, the goal of our paper is to design, analyze and improve the flexibility graph such that the dimension of SSC is minimized. First, we characterize the SSC and thus, the mean delay performance in terms of a given flexibility graph. Using this result, we next study the problem of designing the sparsest flexibility graph that leads to a target SSC dimension. We establish a necessary and sufficient condition on the number of edges required, and provide an algorithm to construct such a graph. Finally, we consider the question of how to improve a given flexibility graph if one is allowed to add a single additional edge. The above results are obtained by identifying a connection to the transportation polytope, and adding to a long line of literature, we develop new theoretical results for it. These results are therefore of independent interest. In particular, we obtain new results on the extreme points and the so-called support graphs of the transportation polytope.

READ FULL TEXT
research
06/08/2018

Flexible Load Balancing with Multi-dimensional State-space Collapse: Throughput and Heavy-traffic Delay Optimality

Heavy traffic analysis for load balancing policies has relied heavily on...
research
08/21/2018

Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems: Necessary and Sufficient Conditions

In this paper, we consider a load balancing system under a general pull-...
research
05/28/2021

Parallel server systems with cancel-on-completion redundancy

We consider a parallel server system with so-called cancel-on-completion...
research
06/11/2017

Group-Server Queues

By analyzing energy-efficient management of data centers, this paper pro...
research
09/23/2017

GB-PANDAS: Throughput and heavy-traffic optimality analysis for affinity scheduling

Dynamic affinity scheduling has been an open problem for nearly three de...
research
10/01/2017

Delay Asymptotics and Bounds for Multi-Task Parallel Jobs

We study delay of jobs that consist of multiple parallel tasks, which is...
research
07/03/2020

Flexibility in an asymmetric system with prolonged service time at non-dedicated servers

The prolonged service time at non-dedicated servers has been observed in...

Please sign up or login with your details

Forgot password? Click here to reset