Colmena: Scalable Machine-Learning-Based Steering of Ensemble Simulations for High Performance Computing

10/06/2021
by   Logan Ward, et al.
17

Scientific applications that involve simulation ensembles can be accelerated greatly by using experiment design methods to select the best simulations to perform. Methods that use machine learning (ML) to create proxy models of simulations show particular promise for guiding ensembles but are challenging to deploy because of the need to coordinate dynamic mixes of simulation and learning tasks. We present Colmena, an open-source Python framework that allows users to steer campaigns by providing just the implementations of individual tasks plus the logic used to choose which tasks to execute when. Colmena handles task dispatch, results collation, ML model invocation, and ML model (re)training, using Parsl to execute tasks on HPC systems. We describe the design of Colmena and illustrate its capabilities by applying it to electrolyte design, where it both scales to 65536 CPUs and accelerates the discovery rate for high-performance molecules by a factor of 100 over unguided searches.

READ FULL TEXT
research
04/10/2021

Achieving 100X faster simulations of complex biological phenomena by coupling ML to HPC ensembles

The use of ML methods to dynamically steer ensemble-based simulations pr...
research
12/05/2019

Merlin: Enabling Machine Learning-Ready HPC Ensembles

With the growing complexity of computational and experimental facilities...
research
04/13/2021

Using Machine Learning at Scale in HPC Simulations with SmartSim: An Application to Ocean Climate Modeling

We demonstrate the first climate-scale, numerical ocean simulations impr...
research
08/24/2020

Integrating Machine Learning with HPC-driven Simulations for Enhanced Student Learning

We explore the idea of integrating machine learning (ML) with high perfo...
research
09/05/2019

Understanding ML driven HPC: Applications and Infrastructure

We recently outlined the vision of "Learning Everywhere" which captures ...
research
06/09/2021

StreamBrain: An HPC Framework for Brain-like Neural Networks on CPUs, GPUs and FPGAs

The modern deep learning method based on backpropagation has surged in p...

Please sign up or login with your details

Forgot password? Click here to reset