Replication in Data Grids: Metrics and Strategies

12/18/2019
by   Tarek Hamrouni, et al.
0

We focus in this report on two main axes. The first is dedicated to the study of the effect of replicas distribution on data grid performances. In this respect, our main contributions are as follows: 1) An overview of replication strategies mainly from the viewpoints of the considered parameters in their associated steps as well as the used metrics in the literature for their evaluation. 2) A study of the impact of placement strategies on data grid performance which motivated the analysis of the effect of the replicas distribution quality on the performance results of replication strategies. 3) The proposal of new evaluation metrics dedicated to the evaluation of the distribution quality. 4) The setting of an objective evaluation of replication strategies which is based on a beforehand assessment of the distribution quality. The second axis is mainly dedicated to exploiting results of data mining techniques to enhance performances of replication strategies. With respect to this axis, we mainly concentrate on the following contributions listed below: 1) The study of the strengths and the drawbacks of the main replication strategies based on data mining techniques and how these latter are applied in this context. 2) The proposal of a new guideline to data mining application in the context of data grid replication strategies. 3) The proposal of a new algorithm for mining maximal frequent correlated patterns. The input of this algorithm is obtained through a preliminary step focusing on how to adapt the required grid concepts to the data mining algorithm. 4) The design and the implementation of a new replication strategy based on a data mining technique, and more precisely correlated patterns.

READ FULL TEXT
research
11/23/2018

Contributions to Biclustering of Microarray Data Using Formal Concept Analysis

Biclustering is an unsupervised data mining technique that aims to unvei...
research
10/26/2020

Quality Prediction in Interlinked Manufacturing Processes based on Supervised & Unsupervised Machine Learning

In the context of a rolling mill case study, this paper presents a metho...
research
04/09/2018

Predicting Dynamic Replication based on Fuzzy System in Data Grid

Data grid replication is an effective method to achieve efficient and fa...
research
08/17/2017

Human Uncertainty and Ranking Error -- The Secret of Successful Evaluation in Predictive Data Mining

One of the most crucial issues in data mining is to model human behaviou...
research
02/09/2019

Replication Can Improve Prior Results: A GitHub Study of Pull Request Acceptance

Crowdsourcing and data mining can be used to effectively reduce the effo...
research
08/06/2015

Replication and Generalization of PRECISE

This report describes an initial replication study of the PRECISE system...
research
05/03/2021

[Re] Three-dimensional wake topology and propulsive performance of low-aspect-ratio pitching-rolling plates

This article reports on a full replication study in computational fluid ...

Please sign up or login with your details

Forgot password? Click here to reset