Latency optimal storage and scheduling of replicated fragments for memory-constrained servers

10/04/2020
by   Rooji Jinan, et al.
0

We consider the setting of distributed storage system where a single file is subdivided into smaller fragments of same size which are then replicated with a common replication factor across servers of identical cache size. An incoming file download request is sent to all the servers, and the download is completed whenever request gathers all the fragments. At each server, we are interested in determining the set of fragments to be stored, and the sequence in which fragments should be accessed, such that the mean file download time for a request is minimized. We model the fragment download time as an exponential random variable independent and identically distributed for all fragments across all servers, and show that the mean file download time can be lower bounded in terms of the expected number of useful servers summed over all distinct fragment downloads. We present deterministic storage schemes that attempt to maximize the number of useful servers. We show that finding the optimal sequence of accessing the fragments is a Markov decision problem, whose complexity grows exponentially with the number of fragments. We propose heuristic algorithms that determine the sequence of access to the fragments which are empirically shown to perform well.

READ FULL TEXT
research
07/22/2020

Fragments-Expert: A Graphical User Interface MATLAB Toolbox for Classification of File Fragments

The classification of file fragments of various file formats is an essen...
research
07/19/2023

Fundamental Limits of Reference-Based Sequence Reordering

The problem of reconstructing a sequence of independent and identically ...
research
03/17/2021

Capacity Achieving Uncoded PIR Protocol based on Combinatorial Designs

In this paper we study the problem of private information retrieval wher...
research
04/14/2023

A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings

File fragment classification (FFC) on small chunks of memory is essentia...
research
02/25/2021

Fragmented Objects: Boosting Concurrency of Shared Large Objects

This work examines strategies to handle large shared data objects in dis...
research
09/16/2014

Improving files availability for BitTorrent using a diffusion model

The BitTorrent mechanism effectively spreads file fragments by copying t...
research
06/06/2022

Managing Bufferbloat in Storage Systems

Today, companies and data centers are moving towards distributed and ser...

Please sign up or login with your details

Forgot password? Click here to reset