Analysis of parallel I/O use on the UK national supercomputing service, ARCHER using Cray LASSi and EPCC SAFE

06/10/2019
by   Andrew Turner, et al.
0

In this paper, we describe how we have used a combination of the LASSi tool (developed by Cray) and the SAFE software (developed by EPCC) to collect and analyse Lustre I/O performance data for all jobs running on the UK national supercomputing service, ARCHER; and to provide reports on I/O usage for users in our standard reporting framework. We also present results from analysis of parallel I/O use on ARCHER and analysis on the potential impact of different applications on file system performance using metrics we have derived from the LASSi data. We show that the performance data from LASSi reveals how the same application can stress different components of the file system depending on how it is run, and how the LASSi risk metrics allow us to identify use cases that could potentially cause issues for global I/O performance and work with users to improve their I/O use. We use the IO-500 benchmark to help us understand how LASSi risk metrics correspond to observed performance on the ARCHER file systems. We also use LASSi data imported into SAFE to identify I/O use patterns associated with different research areas, understand how the research workflow gives rise to the observed patterns and project how this will affect I/O requirements in the future. Finally, we provide an overview of likely future directions for the continuation of this work.

READ FULL TEXT

page 1

page 8

research
10/23/2017

Directory Service Provided by DSCloud Platform

When there are huge volumes of information dispersing in the various mac...
research
04/12/2022

Dementia in England: Quantifying and analysing modifiable risk

The prevalence of dementia is set to explode throughout the 21st century...
research
03/21/2014

File System Design Approaches

In this article, the file system development design approaches are discu...
research
08/27/2019

Performance modeling of a distributed file-system

Data centers have become center of big data processing. Most programs ru...
research
05/19/2020

High Velocity Kernel File Systems with Bento

High development velocity is critical for modern systems. This is especi...
research
06/05/2023

Evaluation of software impact designed for biomedical research: Are we measuring what's meaningful?

Software is vital for the advancement of biology and medicine. Analysis ...
research
09/30/2021

Mac Users Do It Differently: the Role of Operating System and Individual Differences in File Management

Despite much discussion in HCI research about how individual differences...

Please sign up or login with your details

Forgot password? Click here to reset