Distributed storage algorithms with optimal tradeoffs

01/13/2021
by   Michael Luby, et al.
0

One of the primary objectives of a distributed storage system is to reliably store large amounts of source data for long durations using a large number N of unreliable storage nodes, each with c bits of storage capacity. Storage nodes fail randomly over time and are replaced with nodes of equal capacity initialized to zeroes, and thus bits are erased at some rate e. To maintain recoverability of the source data, a repairer continually reads data over a network from nodes at an average rate r, and generates and writes data to nodes based on the read data. The distributed storage source capacity is the maximum amount of source that can be reliably stored for long periods of time. Previous research shows that asymptotically the distributed storage source capacity is at most (1-e/2 · r) · N · c as N and r grow. In this work we introduce and analyze algorithms such that asymptotically the distributed storage source data capacity is at least the above equation. Thus, the above equation expresses a fundamental trade-off between network traffic and storage overhead to reliably store source data.

READ FULL TEXT

page 4

page 5

page 15

page 16

research
02/18/2020

Repair rate lower bounds for distributed storage

One of the primary objectives of a distributed storage system is to reli...
research
02/27/2023

An algorithm for geo-distributed and redundant storage in Garage

This paper presents an optimal algorithm to compute the assignment of da...
research
04/13/2022

On Extremal Rates of Secure Storage over Graphs

A secure storage code maps K source symbols, each of L_w bits, to N code...
research
10/12/2022

On Extremal Rates of Storage over Graphs

A storage code over a graph maps K independent source symbols, each of L...
research
11/19/2019

Audita: A Blockchain-based Auditing Framework for Off-chain Storage

The cloud changed the way we manage and store data. Today, cloud storage...
research
06/24/2020

Fetch-Directed Instruction Prefetching Revisited

Prior work has observed that fetch-directed prefetching (FDIP) is highly...
research
01/15/2020

Secrecy and Accessibility in Distributed Storage

A distributed storage system (DSS) needs to be efficiently accessible an...

Please sign up or login with your details

Forgot password? Click here to reset