GROMACS in the cloud: A global supercomputer to speed up alchemical drug design

01/17/2022
by   Carsten Kutzner, et al.
0

We assess costs and efficiency of state-of-the-art high performance cloud computing compared to a traditional on-premises compute cluster. Our use case are atomistic simulations carried out with the GROMACS molecular dynamics (MD) toolkit with a focus on alchemical protein-ligand binding free energy calculations. We set up a compute cluster in the Amazon Web Services (AWS) cloud that incorporates various different instances with Intel, AMD, and ARM CPUs, some with GPU acceleration. Using representative biomolecular simulation systems we benchmark how GROMACS performs on individual instances and across multiple instances. Thereby we assess which instances deliver the highest performance and which are the most cost-efficient ones for our use case. We find that, in terms of total costs including hardware, personnel, room, energy and cooling, producing MD trajectories in the cloud can be as cost-efficient as an on-premises cluster given that optimal cloud instances are chosen. Further, we find that high-throughput ligand-screening for protein-ligand binding affinity estimation can be accelerated dramatically by using global cloud resources. For a ligand screening study consisting of 19,872 independent simulations, we used all hardware that was available in the cloud at the time of the study. The computations scaled-up to reach peak performances using more than 10,000 instances, 140,000 cores, and 3,000 GPUs simultaneously around the globe. Our simulation ensemble finished in about two days in the cloud, while weeks would be required to complete the task on a typical on-premises cluster consisting of several hundred nodes. We demonstrate that the costs of such and similar studies can be drastically reduced with a checkpoint-restart protocol that allows to use cheap Spot pricing and by using instance types with optimal cost-efficiency.

READ FULL TEXT

page 3

page 5

page 10

page 11

page 24

page 32

page 36

page 42

research
06/16/2020

High-performance cloud computing for exhaustive protein-protein docking

Public cloud computing environments, such as Amazon AWS, Microsoft Azure...
research
01/03/2018

Rapid, concurrent and adaptive extreme scale binding free energy calculation

The recently demonstrated ability to perform accurate, precise and rapid...
research
02/09/2017

Comparative benchmarking of cloud computing vendors with High Performance Linpack

We present a comparative analysis of the maximum performance achieved by...
research
09/09/2020

CASH: A Credit Aware Scheduling for Public Cloud Platforms

The public cloud offers a myriad of services which allows its tenants to...
research
07/20/2020

Parallel Performance of ARM ThunderX2 for Atomistic Simulation Algorithms

Atomistic simulation drives scientific advances in modern material scien...
research
12/26/2017

High-throughput Binding Affinity Calculations at Extreme Scales

Resistance to chemotherapy and molecularly targeted therapies is a major...
research
06/26/2019

HEATS: Heterogeneity- and Energy-Aware Task-based Scheduling

Cloud providers usually offer diverse types of hardware for their users....

Please sign up or login with your details

Forgot password? Click here to reset