Code modernization strategies for short-range non-bonded molecular dynamics simulations

09/22/2021
by   James Vance, et al.
0

As modern HPC systems increasingly rely on greater core counts and wider vector registers, applications need to be adapted to fully utilize these hardware capabilities. One class of applications that can benefit from this increase in parallelism are molecular dynamics simulations. In this paper, we describe our efforts at modernizing the ESPResSo++ molecular dynamics simulation package by restructuring its particle data layout for efficient memory accesses and applying vectorization techniques to benefit the calculation of short-range non-bonded forces, which results in an overall 3 times speedup and serves as a baseline for further optimizations. We also implement finer-grain parallelism for multi-core CPUs through HPX, a C++ runtime system which uses lightweight threads and an asynchronous many-task approach to maximize parallelism. Our goal is to evaluate the performance of an HPX-based approach compared to the bulk-synchronous MPI-based implementation. This requires the introduction of an additional layer to the domain decomposition scheme that defines the task granularity. On spatially inhomogeneous systems, which impose a corresponding load-imbalance in traditional MPI-based approaches, we demonstrate that by choosing an optimal task size, the efficient work-stealing mechanisms of HPX can overcome the overhead of communication resulting in an overall 1.3 times speedup compared to the baseline MPI version.

READ FULL TEXT

page 1

page 7

research
09/09/2021

Performance Analysis of CP2K Code for Ab Initio Molecular Dynamics

Using a realistic molecular catalyst system, we conduct scaling studies ...
research
07/21/2022

Quantifying Overheads in Charm++ and HPX using Task Bench

Asynchronous Many-Task (AMT) runtime systems take advantage of multi-cor...
research
02/08/2018

Tuning Streamed Applications on Intel Xeon Phi: A Machine Learning Based Approach

Many-core accelerators, as represented by the XeonPhi coprocessors and G...
research
06/23/2017

Optimizing the Performance of Reactive Molecular Dynamics Simulations for Multi-Core Architectures

Reactive molecular dynamics simulations are computationally demanding. R...
research
08/03/2017

Long range forces in a performance portable Molecular Dynamics framework

Molecular Dynamics (MD) codes predict the fundamental properties of matt...
research
08/22/2019

Network-Accelerated Non-Contiguous Memory Transfers

Applications often communicate data that is non-contiguous in the send- ...
research
11/25/2020

Rapid Exploration of Optimization Strategies on Advanced Architectures using TestSNAP and LAMMPS

The exascale race is at an end with the announcement of the Aurora and F...

Please sign up or login with your details

Forgot password? Click here to reset