Dynamically Provisioning Cray DataWarp Storage

11/27/2019
by   François Tessier, et al.
0

Complex applications and workflows needs are often exclusively expressed in terms of computational resources on HPC systems. In many cases, other resources like storage or network are not allocatable and are shared across the entire HPC system. By looking at the storage resource in particular, any workflow or application should be able to select both its preferred data manager and its required storage capability or capacity. To achieve such a goal, new mechanisms should be introduced. In this work, we introduce such a mechanism for dynamically provision a data management system on top of storage devices. We particularly focus our effort on deploying a BeeGFS instance across multiple DataWarp nodes on a Cray XC50 system. However, we also demonstrate that the same mechanism can be used to deploy BeeGFS on non-Cray system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2018

A Cross-Layer Solution in Scientific Workflow System for Tackling Data Movement Challenge

Scientific applications in HPC environment are more com-plex and more da...
research
07/01/2021

Toward Interoperable Cyberinfrastructure: Common Descriptions for Computational Resources and Applications

The user-facing components of the Cyberinfrastructure (CI) ecosystem, sc...
research
12/22/2021

Survey the storage systems used in HPC and BDA ecosystems

The advancement in HPC and BDA ecosystem demands a better understanding ...
research
01/04/2023

Analyzing I/O Performance of a Hierarchical HPC Storage System for Distributed Deep Learning

Today, deep learning is an essential technology for our life. To solve m...
research
03/13/2023

Integration of storage endpoints into a Rucio data lake, as an activity to prototype a SKA Regional Centres Network

The Square Kilometre Array (SKA) infrastructure will consist of two radi...
research
11/07/2018

Data Pallets: Containerizing Storage For Reproducibility and Traceability

Trusting simulation output is crucial for Sandia's mission objectives. W...
research
02/25/2021

Optimized Memoryless Fair-Share HPC Resources Scheduling using Transparent Checkpoint-Restart Preemption

Common resource management methods in supercomputing systems usually inc...

Please sign up or login with your details

Forgot password? Click here to reset