accelerating wrf i/o performance with adios2 and network-based streaming

04/13/2023
by   Erick Fredj, et al.
0

With the approach of Exascale computing power for large-scale High Performance Computing (HPC) clusters, the gap between compute capabilities and storage systems is growing larger. This is particularly problematic for the Weather Research and Forecasting Model (WRF), a widely-used HPC application for high-resolution forecasting and research that produces sizable datasets, especially when analyzing transient weather phenomena. Despite this issue, the I/O modules within WRF have not been updated in the past ten years, resulting in subpar parallel I/O performance. This research paper demonstrates the positive impact of integrating ADIOS2, a next-generation parallel I/O framework, as a new I/O backend option in WRF. It goes into detail about the challenges encountered during the integration process and how they were addressed. The resulting I/O times show an over tenfold improvement when using ADIOS2 compared to traditional MPI-I/O based solutions. Furthermore, the study highlights the new features available to WRF users worldwide, such as the Sustainable Staging Transport (SST) enabling Unified Communication X (UCX) DataTransport, the node-local burst buffer write capabilities and in-line lossless compression capabilities of ADIOS2. Additionally, the research shows how ADIOS2's in-situ analysis capabilities can be smoothly integrated with a simple WRF forecasting pipeline, resulting in a significant improvement in overall time to solution. This study serves as a reminder to legacy HPC applications that incorporating modern libraries and tools can lead to considerable performance enhancements with minimal changes to the core application.

READ FULL TEXT

page 1

page 5

page 7

page 8

research
01/20/2022

High Performance Parallel I/O and In-Situ Analysis in the WRF Model with ADIOS2

As the computing power of large-scale HPC clusters approaches the Exasca...
research
06/29/2022

Accelerating Parallel Write via Deeply Integrating Predictive Lossy Compression with HDF5

Lossy compression is one of the most efficient solutions to reduce stora...
research
11/23/2020

Integrating Deep Learning in Domain Sciences at Exascale

This paper presents some of the current challenges in designing deep lea...
research
07/26/2023

Gleam: An RDMA-accelerated Multicast Protocol for Datacenter Networks

RDMA has been widely adopted for high-speed datacenter networks. However...
research
07/20/2020

BeeSwarm: Enabling Scalability Tests in Continuous Integration

Testing is one of the most important steps in software development. It e...
research
08/14/2022

DAOS as HPC Storage, a view from Numerical Weather Prediction

Novel object storage solutions potentially address long-standing scalabi...
research
04/15/2022

Preparing for the Future – Rethinking Proxy Apps

A considerable amount of research and engineering went into designing pr...

Please sign up or login with your details

Forgot password? Click here to reset