DMTCP Checkpoint/Restart of MPI Programs via Proxies

03/25/2018
by   Gregory Michael Price, et al.
0

MPI accomplishes portable, standardized message-passing between processes by exposing a standard API that hides the implementation of the underlying mechanism for message passing. Until now, checkpointing an MPI program required knowledge of these underlying mechanisms. Through the addition of a proxy, we demonstrate that MPI programs can be checkpointed and restarted regardless of the MPI implementation utilized. Further, proxies may enable MPI programs to be checkpointed on one MPI implementation, and restarted on another.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2023

MPI Advance : Open-Source Message Passing Optimizations

The large variety of production implementations of the message passing i...
research
10/06/2003

Poster on MPI application in Computational Fluid Dynamics

Poster-presentation of the paper "Message Passing Fluids: molecules as p...
research
12/10/2013

Towards deductive verification of MPI programs against session types

The Message Passing Interface (MPI) is the de facto standard message-pas...
research
01/15/2023

Synthesizing Proxy Applications for MPI Programs

Proxy applications (proxy-apps) are basic tools for evaluating the perfo...
research
03/16/2018

Combining Symbolic Execution and Model Checking to Verify MPI Programs

Message Passing Interface (MPI) is the standard paradigm of programming ...
research
05/15/2020

Elastic execution of checkpointed MPI applications

MPI applications begin with a fixed number of rank and, by default, the ...
research
12/30/2019

Performance Evaluation of Dynamic Scaling on MPI

Dynamic scaling aims to elastically change the number of processes durin...

Please sign up or login with your details

Forgot password? Click here to reset