Repair rate lower bounds for distributed storage

02/18/2020
by   Michael Luby, et al.
0

One of the primary objectives of a distributed storage system is to reliably store a large amount dsize of source data for a long duration using a large number N of unreliable storage nodes, each with capacity nsize. The storage overhead β is the fraction of system capacity available beyond dsize, i.e., β = 1- dsize/N · nsize. Storage nodes fail randomly over time and are replaced with initially empty nodes, and thus data is erased from the system at an average rate erate = λ· N · nsize, where 1/λ is the average lifetime of a node before failure. To maintain recoverability of the source data, a repairer continually reads data over a network from nodes at some average rate rrate, and generates and writes data to nodes based on the read data. The main result is that, for any repairer, if the source data is recoverable at each point in time then it must be the case that rrate >erate/2 ·β asymptotically as N goes to infinity and beta goes to zero. This inequality provides a fundamental lower bound on the average rate that any repairer needs to read data from the system in order to maintain recoverability of the source data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/13/2021

Distributed storage algorithms with optimal tradeoffs

One of the primary objectives of a distributed storage system is to reli...
research
08/26/2019

Capacity of dynamical storage systems

We introduce a dynamical model of node repair in distributed storage sys...
research
05/22/2017

Liquid Cloud Storage

A liquid system provides durable object storage based on spreading redun...
research
10/23/2020

Efficient Storage Schemes for Desired Service Rate Regions

A major concern in cloud/edge storage systems is serving a large number ...
research
01/13/2018

Distributed Multi-User Secret Sharing

A distributed secret sharing system is considered that consists of a dea...
research
01/06/2018

A Class of MSR Codes for Clustered Distributed Storage

Clustered distributed storage models real data centers where intra- and ...
research
05/15/2020

Data Freshness in Leader-Based Replicated Storage

Leader-based data replication improves consistency in highly available d...

Please sign up or login with your details

Forgot password? Click here to reset