Harnessing Correlations in Distributed Erasure Coded Key-Value Stores

10/02/2018
by   Ramy E. Ali, et al.
0

Motivated by applications of distributed storage systems to cloud-based key-value stores, the multi-version coding problem has been recently formulated to efficiently store frequently updated data in asynchronous decentralized storage systems. Inspired by consistency requirements in distributed systems, the main goal in multi-version coding is to ensure that the latest possible version of the data is decodable, even if the data updates have not reached some servers in the system. In this paper, we study the storage cost of ensuring consistency for the case where the data versions are correlated, in contrast to previous work where data versions were treated as being independent. We provide multi-version code constructions that show that the storage cost can be significantly smaller than the previous constructions depending on the degree of correlation between the different versions of the data. Our achievability results are based on Reed-Solomon codes and random binning. Through an information-theoretic converse, we show that our multi-version codes are nearly-optimal in certain regimes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2018

Multi-version Coding with Side Information

In applications of storage systems to modern key-value stores, the store...
research
12/02/2019

Multi-version Indexing in Flash-based Key-Value Stores

Maintaining multiple versions of data is popular in key-value stores sin...
research
10/29/2019

Locally recoverable codes on surfaces

A linear error correcting code is a subspace of a finite dimensional spa...
research
02/26/2021

CausalEC: A Causally Consistent Data Storage Algorithm based on Cross-Object Erasure Coding

Causally consistent distributed storage systems have received significan...
research
02/06/2018

Erasure correction of scalar codes in the presence of stragglers

Recent advances in coding for distributed storage systems have reignited...
research
05/21/2020

Modeling and Optimization of Latency in Erasure-coded Storage Systems

As consumers are increasingly engaged in social networking and E-commerc...
research
04/16/2020

ForkBase: Immutable, Tamper-evident Storage Substrate for Branchable Applications

Data collaboration activities typically require systematic or protocol-b...

Please sign up or login with your details

Forgot password? Click here to reset