Modernizing the HPC System Software Stack

07/20/2020
by   Benjamin S. Allen, et al.
0

Through the 1990s, HPC centers at national laboratories, universities, and other large sites designed distributed system architectures and software stacks that enabled extreme-scale computing. By the 2010s, these centers were eclipsed by the scale of web-scale and cloud computing architectures, and today even upcoming exascale HPC systems are magnitudes of scale smaller than those of datacenters employed by large web companies. Meanwhile, the HPC community has allowed system software designs to stagnate, relying on incremental changes to tried-and-true designs to move between generations of systems. We contend that a modern system software stack that focuses on manageability, scalability, security, and modern methods will benefit the entire HPC community. In this paper, we break down the logical parts of a typical HPC system software stack, look at more modern ways to meet their needs, and make recommendations of future work that would help the community move in that direction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2023

Survey of adaptive containerization architectures for HPC

Containers offer an array of advantages that benefit research reproducib...
research
06/24/2020

Integrating LHCb workflows on HPC resources: status and strategies

High Performance Computing (HPC) supercomputers are expected to play an ...
research
03/29/2023

Overcoming Challenges to Continuous Integration in HPC

Continuous integration (CI) has become a ubiquitous practice in modern s...
research
03/22/2018

SCISPACE: A Scientific Collaboration Workspace for File Systems in Geo-Distributed HPC Data Centers

Future terabit networks are committed to dramatically improving big data...
research
05/16/2018

A Software-Defined QoS Provisioning Framework for HPC Applications

With the emergence of large-scale data-intensive high-performance applic...
research
09/27/2020

A highly scalable Met Office NERC Cloud model

Large Eddy Simulation is a critical modelling tool for scientists invest...
research
10/27/2022

Noise in the Clouds: Influence of Network Performance Variability on Application Scalability

Cloud computing represents an appealing opportunity for cost-effective d...

Please sign up or login with your details

Forgot password? Click here to reset