On Exact and Approximate Policies for Linear Tape Scheduling in Data Centers

12/13/2021
by   Carlos H. Cardonha, et al.
0

This paper investigates scheduling policies for file retrieval in linear storage devices, such as magnetic tapes. Tapes are the technology of choice for long-term storage in data centers due to their low cost per capacity, reliability, and data security. While scheduling problems associated with data retrieval in tapes are classical, existing works focus on more straightforward heuristic approaches due to limited computational times imposed by standard tape specifications. Our first contribution is a theoretical investigation of three standard policies, presenting their worst-case performance and special cases of practical relevance for which they are optimal. Next, we show that the problem is polynomially solvable via two interleaved recursive models, albeit with high computational complexity. We leverage our previous results to develop two new scheduling policies with constant-ratio performance and low computational cost. Finally, we investigate properties associated with the online variant of the problem, presenting a new constant-factor competitive algorithm. Our numerical analysis on synthetic and real-world tapes from an industry partner provides insights into dataset configurations where each policy is more effective, which is of relevance to data center managers. In particular, our new best-performing policy is practical for large datasets and significantly improves upon standard algorithms in the area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2018

Theoretical and Practical Aspects of the Linear Tape Scheduling Problem

Magnetic tapes have been playing a key role as means for storage of digi...
research
12/17/2021

An Exact Algorithm for the Linear Tape Scheduling Problem

Magnetic tapes are often considered as an outdated storage technology, y...
research
01/07/2020

On Competitive Analysis for Polling Systems

Polling systems have been widely studied, however most of these studies ...
research
08/03/2018

A Stochastic Model for File Lifetime and Security in Data Center Networks

Data center networks are an important infrastructure in various applicat...
research
06/12/2018

Techniques for Efficiently Handling Power Surges in Fuel Cell Powered Data Centers: Modeling, Analysis, Results

Fuel cells are a promising power source for future data centers, offerin...
research
06/30/2023

Algorithms for Shipping Container Delivery Scheduling

Motivated by distribution problems arising in the supply chain of Haleon...

Please sign up or login with your details

Forgot password? Click here to reset