Scheduling Algorithms for Efficient Execution of Stream Workflow Applications in Multicloud Environments

12/18/2019
by   Mutaz Barika, et al.
0

Big data processing applications are becoming more and more complex. They are no more monolithic in nature but instead they are composed of decoupled analytical processes in the form of a workflow. One type of such workflow applications is stream workflow application, which integrates multiple streaming big data applications to support decision making. Each analytical component of these applications runs continuously and processes data streams whose velocity will depend on several factors such as network bandwidth and processing rate of parent analytical component. As a consequence, the execution of these applications on cloud environments requires advanced scheduling techniques that adhere to end user's requirements in terms of data processing and deadline for decision making. In this paper, we propose two Multicloud scheduling and resource allocation techniques for efficient execution of stream workflow applications on Multicloud environments while adhering to workflow application and user performance requirements and reducing execution cost. Results showed that the proposed genetic algorithm is an adequate and effective for all experiments.

READ FULL TEXT
research
12/18/2019

Adaptive Scheduling for Efficient Execution of Dynamic Stream Workflows

Stream workflow application such as online anomaly detection or online t...
research
02/11/2022

Global Optimization of Data Pipelines in Heterogeneous Cloud Environments

Modern production data processing and machine learning pipelines on the ...
research
08/28/2022

Reshape: Adaptive Result-aware Skew Handling for Exploratory Analysis on Big Data

The process of data analysis, especially in GUI-based analytics systems,...
research
03/06/2022

Managing Complex Workflows in Bioinformatics - An Interactive Toolkit with GPU Acceleration

Bioinformatics research continues to advance at an increasing scale with...
research
05/19/2018

Partitioning SKA Dataflows for Optimal Graph Execution

Optimizing data-intensive workflow execution is essential to many modern...
research
09/07/2019

Analyzing the HCP Datasets using GPUs: The Anatomy of a Science Engagement

This paper documents the experience improving the performance of a data ...

Please sign up or login with your details

Forgot password? Click here to reset