Optimal Data Placement for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

04/13/2021
by   Xin Du, et al.
0

The heterogeneous edge-cloud computing paradigm can provide a more optimal direction to deploy scientific workflows than traditional distributed computing or cloud computing environments. Due to the different sizes of scientific datasets and some of these datasets must keep private, it is still a difficult problem to finding an data placement strategy that can minimize data transmission as well as placement cost. To address this issue, this paper combines advantages of both edge and cloud computing to construct a data placement model, which can balance data transfer time and data placement cost using intelligent computation. The most difficult research challenge the model solved is to consider many constrain in this hybrid computing environments, which including shared datasets within individual and among multiple workflows across various geographical regions. According to the constructed model, the study propose a new data placement strategy named DE-DPSO-DPS, which using a discrete particle swarm optimization algorithm with differential evolution (DE-DPSO-DPA) to distribute these scientific datasets. The strategy also not only consider the characteristics such as the number and storage capacity of edge micro-datacenters, the bandwidth between different datacenters and the proportion of private datasets, but also analysis the performance of algorithm during the workflows execution. Comprehensive experiments are designed in simulated heterogeneous edge-cloud computing environments demonstrate that the data placement strategy can effectively reduce the data transmission time and placement cost as compared to traditional strategies for data-sharing scientific workflows.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2022

Scientific Workflows in Heterogeneous Edge-Cloud Computing: A Data Placement Strategy Based on Reinforcement learning

The heterogeneous edge-cloud computing paradigm can provide an optimal s...
research
01/22/2019

A Time-driven Data Placement Strategy for a Scientific Workflow Combining Edge Computing and Cloud Computing

Compared to traditional distributed computing environments such as grids...
research
02/09/2018

Heterogeneous and Multidimensional Clairvoyant Dynamic Bin Packing for Virtual Machine Placement

Although the public cloud still occupies the largest portion of the tota...
research
01/26/2023

A Cloud-Edge Continuum Experimental Methodology Applied to a 5G Core Study

There is an increasing interest in extending traditional cloud-native te...
research
01/18/2023

HLC2: a highly efficient cross-matching framework for large astronomical catalogues on heterogeneous computing environments

Cross-matching operation, which is to find corresponding data for the sa...
research
03/13/2022

Network Bandwidth Allocation Problem For Cloud Computing

Cloud computing enables ubiquitous, convenient, and on-demand network ac...
research
02/05/2018

On Distributed Algorithms for Cost-Efficient Data Center Placement in Cloud Computing

The increasing popularity of cloud computing has resulted in a prolifera...

Please sign up or login with your details

Forgot password? Click here to reset