RPDP: An Efficient Data Placement based on Residual Performance for P2P Storage Systems

04/18/2023
by   Fitrio Pakana, et al.
0

Storage systems using Peer-to-Peer (P2P) architecture are an alternative to the traditional client-server systems. They offer better scalability and fault tolerance while at the same time eliminate the single point of failure. The nature of P2P storage systems (which consist of heterogeneous nodes) introduce however data placement challenges that create implementation trade-offs (e.g., between performance and scalability). Existing Kademlia-based DHT data placement method stores data at closest node, where the distance is measured by bit-wise XOR operation between data and a given node. This approach is highly scalable because it does not require global knowledge for placing data nor for the data retrieval. It does not however consider the heterogeneous performance of the nodes, which can result in imbalanced resource usage affecting the overall latency of the system. Other works implement criteria-based selection that addresses heterogeneity of nodes, however often cause subsequent data retrieval to require global knowledge of where the data stored. This paper introduces Residual Performance-based Data Placement (RPDP), a novel data placement method based on dynamic temporal residual performance of data nodes. RPDP places data to most appropriate selected nodes based on their throughput and latency with the aim to achieve lower overall latency by balancing data distribution with respect to the individual performance of nodes. RPDP relies on Kademlia-based DHT with modified data structure to allow data subsequently retrieved without the need of global knowledge. The experimental results indicate that RPDP reduces the overall latency of the baseline Kademlia-based P2P storage system (by 4.87 the nodes, with minimal impact to the data retrieval complexity.

READ FULL TEXT
research
12/05/2022

Evaluation of Locality, Latency and Geospace Aware Data Placement Strategies at the Edge

With the rise in the adaptation of edge computing frameworks for applica...
research
03/21/2018

A Robust Fault-Tolerant and Scalable Cluster-wide Deduplication for Shared-Nothing Storage Systems

Deduplication has been largely employed in distributed storage systems t...
research
02/06/2019

Blockchain Storage Load Balancing Among DHT Clustered Nodes

In Bitcoin, to independently verify whether new transactions are correct...
research
12/05/2020

Optimal Caching for Low Latency in Distributed Coded Storage Systems

Erasure codes have been widely considered a promising solution to enhanc...
research
06/28/2023

Blockchain Nodes are Heterogeneous and Your P2P Overlay Should be Too: PODS

At the core of each blockchain system, parties communicate through a pee...
research
02/20/2018

CASPaxos: Replicated State Machines without logs

CASPaxos is a replicated state machine (RSM) protocol, an extension of S...
research
04/08/2021

Read-Write Quorum Systems Made Practical

Quorum systems are a powerful mechanism for ensuring the consistency of ...

Please sign up or login with your details

Forgot password? Click here to reset