Detecting Straggler MapReduce Tasks in Big Data Processing Infrastructure by Neural Network

04/13/2020
by   Amir Javadpour, et al.
0

Straggler task detection is one of the main challenges in applying MapReduce for parallelizing and distributing large-scale data processing. It is defined as detecting running tasks on weak nodes. Considering two stages in the Map phase copy, combine and three stages of Reduce shuffle, sort and reduce, the total execution time is the total sum of the execution time of these five stages. Estimating the correct execution time in each stage that results in correct total execution time is the primary purpose of this paper. The proposed method is based on the application of a backpropagation Neural Network NN on the Hadoop for the detection of straggler tasks, to estimate the remaining execution time of tasks that is very important in straggler task detection. Results achieved have been compared with popular algorithms in this domain such as LATE, ESAMR and the real remaining time for WordCount and Sort benchmarks, and shown able to detect straggler tasks and estimate execution time accurately. Besides, it supports to accelerate task execution time.

READ FULL TEXT

page 9

page 14

research
06/26/2018

Improving tasks throughput on accelerators using OpenCL command concurrency

A heterogeneous architecture composed by a host and an accelerator must ...
research
09/28/2019

Profiling minisat based on user defined execution time – GPROF

This paper focuses on the explanation of the architecture of profilers p...
research
07/08/2021

Optimizing Data Processing in Space for Object Detection in Satellite Imagery

There is a proliferation in the number of satellites launched each year,...
research
12/10/2020

Descriptive and Predictive Analysis of Aggregating Functions in Serverless Clouds: the Case of Video Streaming

Serverless clouds allocate multiple tasks (e.g., micro-services) from mu...
research
11/25/2022

strategFTO: Untimed control for timed opacity

We introduce a prototype tool strategFTO addressing the verification of ...
research
08/29/2023

SimSched: A tool for Simulating Autosar Implementaion in Simulink

AUTOSAR (AUTomotive Open System ARchitecture) is an open industry standa...
research
04/26/2019

Simulating Execution Time of Tensor Programs using Graph Neural Networks

Optimizing the execution time of tensor program, e.g., a convolution, in...

Please sign up or login with your details

Forgot password? Click here to reset