An Analytical Model-based Capacity Planning Approach for Building CSD-based Storage Systems

06/07/2023
by   Hongsu Byun, et al.
0

The data movement in large-scale computing facilities (from compute nodes to data nodes) is categorized as one of the major contributors to high cost and energy utilization. To tackle it, in-storage processing (ISP) within storage devices, such as Solid-State Drives (SSDs), has been explored actively. The introduction of computational storage drives (CSDs) enabled ISP within the same form factor as regular SSDs and made it easy to replace SSDs within traditional compute nodes. With CSDs, host systems can offload various operations such as search, filter, and count. However, commercialized CSDs have different hardware resources and performance characteristics. Thus, it requires careful consideration of hardware, performance, and workload characteristics for building a CSD-based storage system within a compute node. Therefore, storage architects are hesitant to build a storage system based on CSDs as there are no tools to determine the benefits of CSD-based compute nodes to meet the performance requirements compared to traditional nodes based on SSDs. In this work, we proposed an analytical model-based storage capacity planner called CSDPlan for system architects to build performance-effective CSD-based compute nodes. Our model takes into account the performance characteristics of the host system, targeted workloads, and hardware and performance characteristics of CSDs to be deployed and provides optimal configuration based on the number of CSDs for a compute node. Furthermore, CSDPlan estimates and reduces the total cost of ownership (TCO) for building a CSD-based compute node. To evaluate the efficacy of CSDPlan, we selected two commercially available CSDs and 4 representative big data analysis workloads.

READ FULL TEXT
research
09/16/2018

I/O Workload Management for All-Flash Datacenter Storage Systems Based on Total Cost of Ownership

Recently, the capital expenditure of flash-based Solid State Driver (SSD...
research
12/23/2021

In-storage Processing of I/O Intensive Applications on Computational Storage Drives

Computational storage drives (CSD) are solid-state drives (SSD) empowere...
research
12/21/2021

Maxwell: a hardware and software highly integrated compute-storage system

The compute-storage framework is responsible for data storage and proces...
research
04/12/2022

"Smarter" NICs for faster molecular dynamics: a case study

This work evaluates the benefits of using a "smart" network interface ca...
research
05/22/2018

Storage and Memory Characterization of Data Intensive Workloads for Bare Metal Cloud

As the cost-per-byte of storage systems dramatically decreases, SSDs are...
research
02/17/2020

STANNIS: Low-Power Acceleration of Deep NeuralNetwork Training Using Computational Storage

This paper proposes a framework for distributed, in-storage training of ...
research
12/26/2021

Airphant: Cloud-oriented Document Indexing

Modern data warehouses can scale compute nodes independently of storage....

Please sign up or login with your details

Forgot password? Click here to reset