Data Synchronization: A Complete Theoretical Solution for Filesystems

10/10/2022
by   Elod P. Csirmaz, et al.
0

Data reconciliation in general, and filesystem synchronization in particular, lacks rigorous theoretical foundation. This paper presents, for the first time, a complete analysis of synchronization for two replicas of a theoretical filesystem. Synchronization has two main stages: identifying the conflicts, and resolving them. All existing (both theoretical and practical) synchronizers are operation-based: they define, using some rationale or heuristics, how conflicts are to be resolved without considering the effect of the resolution on subsequent conflicts. Instead, our approach is declaration-based: we define what constitutes the resolution of all conflicts, and for each possible scenario we prove the existence of sequences of operations / commands which convert the replicas into a common synchronized state. These sequences consist of operations rolling back some local changes, followed by operations performed on the other replica. The set of rolled-back operations provides the user with clear and intuitive information on the proposed changes, so she can easily decide whether to accept them or ask for other alternatives. All possible synchronized states are described by specifying a set of conflicts, a partial order on the conflicts describing the order in which they need to be resolved, as well as the effect of each decision on subsequent conflicts. Using this classification, the outcomes of different conflict resolution policies can be investigated easily.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2018

Efficient Synchronization of State-based CRDTs

Data consistency often needs to be sacrificed in order to ensure high-av...
research
11/06/2020

A Precedence-Driven Approach for Concurrent Model Synchronization Scenarios using Triple Graph Grammars

Concurrent model synchronization is the task of restoring consistency be...
research
01/24/2018

Anticipating epileptic seizures through the analysis of EEG synchronization as a data classification problem

Epilepsy is a neurological disorder arising from anomalies of the electr...
research
04/29/2020

Compilation of Coordinated Choice

Recently, we have proposed coordinated choices, which are nondeterminist...
research
02/19/2023

Synchronizing Many Filesystems in Near Linear Time

Finding a provably correct subquadratic synchronization algorithm for ma...
research
09/30/2022

A Complete Set of Connectivity-aware Local Topology Manipulation Operations for Robot Swarms

The topology of a robotic swarm affects the convergence speed of consens...
research
01/19/2018

Towards a Theory of Data-Diff: Optimal Synthesis of Succinct Data Modification Scripts

This paper addresses the Data-Diff problem: given a dataset and a subseq...

Please sign up or login with your details

Forgot password? Click here to reset