Modeling and Optimization of Latency in Erasure-coded Storage Systems

05/21/2020
by   Vaneet Aggarwal, et al.
0

As consumers are increasingly engaged in social networking and E-commerce activities, businesses grow to rely on Big Data analytics for intelligence, and traditional IT infrastructures continue to migrate to the cloud and edge, these trends cause distributed data storage demand to rise at an unprecedented speed. Erasure coding has seen itself quickly emerged as a promising technique to reduce storage cost while providing similar reliability as replicated systems, widely adopted by companies like Facebook, Microsoft and Google. However, it also brings new challenges in characterizing and optimizing the access latency when erasure codes are used in distributed storage. The aim of this monograph is to provide a review of recent progress (both theoretical and practical) on systems that employ erasure codes for distributed storage. In this monograph, we will first identify the key challenges and taxonomy of the research problems and then give an overview of different approaches that have been developed to quantify and model latency of erasure-coded storage. This includes recent work leveraging MDS-Reservation, Fork-Join, Probabilistic, and Delayed-Relaunch scheduling policies, as well as their applications to characterize access latency (e.g., mean, tail, asymptotic latency) of erasure-coded distributed storage systems. We will also extend the problem to the case when users are streaming videos from erasure-coded distributed storage systems. Next, we bridge the gap between theory and practice, and discuss lessons learned from prototype implementation. In particular, we will discuss exemplary implementations of erasure-coded storage, illuminate key design degrees of freedom and tradeoffs, and summarize remaining challenges in real-world storage systems such as in content delivery and caching. Open problems for future research are discussed at the end of each chapter.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2020

Optimal Caching for Low Latency in Distributed Coded Storage Systems

Erasure codes have been widely considered a promising solution to enhanc...
research
08/25/2019

Babel Storage: Uncoordinated Content Delivery from Multiple Coded Storage Systems

In future content-centric networks, content is identified independently ...
research
01/22/2023

Durability and Availability of Erasure-Coded Storage Systems with Concurrent Maintenance

This initial version of this document was written back in 2014 for the s...
research
02/03/2021

Optimizing QoS for Erasure-Coded Wireless Data Centers

Cloud computing facilitates the access of applications and data from any...
research
10/02/2018

Harnessing Correlations in Distributed Erasure Coded Key-Value Stores

Motivated by applications of distributed storage systems to cloud-based ...
research
03/23/2022

Rank-Metric Codes and Their Applications

The rank metric measures the distance between two matrices by the rank o...
research
03/24/2017

Video Streaming in Distributed Erasure-coded Storage Systems: Stall Duration Analysis

The demand for global video has been burgeoning across industries. With ...

Please sign up or login with your details

Forgot password? Click here to reset