Towards Learned Predictability of Storage Systems

07/30/2023
by   Chenyuan Wu, et al.
0

With the rapid development of cloud computing and big data technologies, storage systems have become a fundamental building block of datacenters, incorporating hardware innovations such as flash solid state drives and non-volatile memories, as well as software infrastructures such as RAID and distributed file systems. Despite the growing popularity and interests in storage, designing and implementing reliable storage systems remains challenging, due to their performance instability and prevailing hardware failures. Proactive prediction greatly strengthens the reliability of storage systems. There are two dimensions of prediction: performance and failure. Ideally, through detecting in advance the slow IO requests, and predicting device failures before they really happen, we can build storage systems with especially low tail latency and high availability. While its importance is well recognized, such proactive prediction in storage systems, on the other hand, is particularly difficult. To move towards predictability of storage systems, various mechanisms and field studies have been proposed in the past few years. In this report, we present a survey of these mechanisms and field studies, focusing on machine learning based black-box approaches. Based on three representative research works, we discuss where and how machine learning should be applied in this field. The strengths and limitations of each research work are also evaluated in detail.

READ FULL TEXT
research
03/22/2022

BigBird: Big Data Storage and Analytics at Scale in Hybrid Cloud

Implementing big data storage at scale is a complex and arduous task tha...
research
07/21/2023

A Survey on the Integration of NAND Flash Storage in the Design of File Systems and the Host Storage Software Stack

With the ever-increasing amount of data generate in the world, estimated...
research
06/29/2018

Complying with Data Handling Requirements in Cloud Storage Systems

In past years, cloud storage systems saw an enormous rise in usage. Howe...
research
05/06/2020

On Failure Diagnosis of the Storage Stack

Diagnosing storage system failures is challenging even for professionals...
research
07/31/2023

Confidential Computing across Edge-to-Cloud for Machine Learning: A Survey Study

Confidential computing has gained prominence due to the escalating volum...
research
02/13/2022

Towards Decentralised Cloud Storage with IPFS: Opportunities, Challenges, and Future Directions

The InterPlanetary File System (IPFS) is a novel decentralised storage a...
research
12/22/2020

The Life and Death of SSDs and HDDs: Similarities, Differences, and Prediction Models

Data center downtime typically centers around IT equipment failure. Stor...

Please sign up or login with your details

Forgot password? Click here to reset