Asymptotic response time analysis for multi-task parallel jobs

10/01/2017
by   Weina Wang, et al.
0

The response time of jobs with multiple parallel tasks is a critical performance metric in many systems, including MapReduce systems, coded data storage systems, etc. However, tight analytical characterizations of the response time of such jobs are largely unknown except for highly degenerate cases. The difficulty is rooted in the fact that a job with multiple tasks is considered complete only when all of its tasks complete processing; i.e., the job response time is the maximum of the response times of its tasks, which is hard to analyze since these task response times are generally not independent. In this paper, we approach this problem by studying when the response times of a job's tasks are close to being independent. We consider a limited fork-join model with n servers, where each job consists of k^(n)< n tasks. Upon arrival, each job chooses k^(n) distinct servers uniformly at random and sends one task to each server. We assume Poisson job arrivals and generally distributed task service times. We establish that under the condition k^(n) = o(n^1/4), the steady state response times at any k^(n) servers are asymptotically independent, as n grows large. This result greatly generalizes the asymptotic-independence type of results in the literature where asymptotic independence is shown only for a fixed constant number of queues. We then further show that the job response time converges to the maximum of independent task response times, in a proper sense. This gives the first asymptotically tight analytical characterization of the response time of a multi-task parallel job. To complement the asymptotic independence result, we also show that when k^(n)=Θ(n), any number of multiple queues are not asymptotically independent. Analysis for the regime of k^(n) between o(n^1/4) and Θ(n) remains open.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2017

Delay Asymptotics and Bounds for Multi-Task Parallel Jobs

We study delay of jobs that consist of multiple parallel tasks, which is...
research
10/21/2019

Delay-optimal policies in partial fork-join systems with redundancy and random slowdowns

We consider a large distributed service system consisting of n homogeneo...
research
10/15/2021

Time to critical condition in emergency services

Providing uninterrupted response service is of paramount importance for ...
research
04/05/2020

Achieving Zero Asymptotic Queueing Delay for Parallel Jobs

Zero queueing delay is highly desirable in large-scale computing systems...
research
06/09/2021

Non-Parametric Stochastic Sequential Assignment With Random Arrival Times

We consider a problem wherein jobs arrive at random times and assume ran...
research
07/08/2022

Tackling Heterogeneous Traffic in Multi-access Systems via Erasure Coded Servers

Most data generated by modern applications is stored in the cloud, and t...
research
06/10/2020

Product Forms for FCFS Queueing Models with Arbitrary Server-Job Compatibilities: An Overview

In recent years a number of models involving different compatibilities b...

Please sign up or login with your details

Forgot password? Click here to reset