Predicting Dynamic Replication based on Fuzzy System in Data Grid

04/09/2018
by   Mahnaz Khojand, et al.
0

Data grid replication is an effective method to achieve efficient and fault tolerant data access while reducing access latency and bandwidth consumption in grids. Since we have storage limitation, a replica should be created in the best site. Through evaluation of previously suggested algorithms, we understand that by blind creation of replications on different sites after each demand, we may be able to improve algorithm regarding response time. In practice, however, most of the created replications will never be used and existing resources in Grid will be wasted through the creation of unused replications. In this paper, we propose a new dynamic replication algorithm called Predictive Fuzzy Replication (PFR). PFR not only redefines the Balanced Ant Colony Optimization (BACO) algorithm, which is used for job scheduling in grids, but also uses it for replication in appropriate sites in the data grid. The new algorithm considers the history usage of files, files size, the level of the sites and free available space for replication and tries to predict future needs and pre replicates them in the resources that are more suitable or decides which replica should be deleted if there is not enough space for replicating. This algorithm considers the related files of the replicated file and replicates them considering their own history. PFR acts more efficiently than Cascading method, which is one of the algorithms in optimized use of existing replicas.

READ FULL TEXT
research
10/20/2017

Transparent Replication Using Metaprogramming in Cyan

Replication can be used to increase the availability of a service by cre...
research
12/18/2019

Replication in Data Grids: Metrics and Strategies

We focus in this report on two main axes. The first is dedicated to the ...
research
12/06/2019

Data Replication for Reducing Computing Time inDistributed Systems with Stragglers

In distributed computing systems with stragglers,various forms of redund...
research
01/31/2022

Fragmented ARES: Dynamic Storage for Large Objects

Data availability is one of the most important features in distributed s...
research
02/21/2014

A Survey on Dynamic Job Scheduling in Grid Environment Based on Heuristic Algorithms

Computational Grids are a new trend in distributed computing systems. Th...
research
05/06/2023

Bayesian sample size determination for multi-site replication studies

An ongoing "reproducibility crisis" calls into question scientific disco...
research
02/03/2021

Optimizing QoS for Erasure-Coded Wireless Data Centers

Cloud computing facilitates the access of applications and data from any...

Please sign up or login with your details

Forgot password? Click here to reset