Reproducibility and Performance: Why Choose?

03/15/2022
by   Ludovic Courtès, et al.
0

Research processes often rely on high-performance computing (HPC), but HPC is often seen as antithetical to "reproducibility": one would have to choose between software that achieves high performance, and software that can be deployed in a reproducible fashion. However, by giving up on reproducibility we would give up on verifiability, a foundation of the scientific process. How can we conciliate performance and reproducibility? This article looks at two performance-critical aspects in HPC: message passing (MPI) and CPU micro-architecture tuning. Engineering work that has gone into performance portability has already proved fruitful, but some areas remain unaddressed when it comes to CPU tuning. We propose package multi-versioning, a technique developed for GNU Guix, a tool for reproducible software deployment, and show that it allows us to implement CPU tuning without compromising on reproducibility and provenance tracking.

READ FULL TEXT
research
08/29/2023

Designing Laboratory Notebooks for Managing Software-based Scientific Studies

Scientific processes rely on software as an important tool for data acqu...
research
09/01/2022

Reproducible Cross-border High Performance Computing for Scientific Portals

To reproduce eScience, several challenges need to be solved: scientific ...
research
04/20/2018

Use Cases of Computational Reproducibility for Scientific Workflows at Exascale

We propose an approach for improved reproducibility that includes captur...
research
04/19/2017

Testing Docker Performance for HPC Applications

The main goal for this article is to compare performance penalties when ...
research
07/21/2023

Software engineering to sustain a high-performance computing scientific application: QMCPACK

We provide an overview of the software engineering efforts and their imp...
research
08/27/2022

The Ghost of Performance Reproducibility Past

The importance of ensemble computing is well established. However, execu...
research
05/14/2020

Reproducibility of Parallel Preconditioned Conjugate Gradient in Hybrid Programming Environments

The Preconditioned Conjugate Gradient method is often employed for the s...

Please sign up or login with your details

Forgot password? Click here to reset