Understanding ML driven HPC: Applications and Infrastructure

09/05/2019
by   Geoffrey Fox, et al.
0

We recently outlined the vision of "Learning Everywhere" which captures the possibility and impact of how learning methods and traditional HPC methods can be coupled together. A primary driver of such coupling is the promise that Machine Learning (ML) will give major performance improvements for traditional HPC simulations. Motivated by this potential, the ML around HPC class of integration is of particular significance. In a related follow-up paper, we provided an initial taxonomy for integrating learning around HPC methods. In this paper, which is part of the Learning Everywhere series, we discuss "how" learning methods and HPC simulations are being integrated to enhance effective performance of computations. This paper identifies several modes — substitution, assimilation, and control, in which learning methods integrate with HPC simulations and provide representative applications in each mode. This paper discusses some open research questions and we hope will motivate and clear the ground for MLaroundHPC benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2019

Learning Everywhere: Pervasive Machine Learning for Effective High-Performance Computation

The convergence of HPC and data-intensive methodologies provide a promis...
research
08/24/2022

AI-coupled HPC Workflows

Increasingly, scientific discovery requires sophisticated and scalable w...
research
04/13/2021

Using Machine Learning at Scale in HPC Simulations with SmartSim: An Application to Ocean Climate Modeling

We demonstrate the first climate-scale, numerical ocean simulations impr...
research
08/24/2020

Integrating Machine Learning with HPC-driven Simulations for Enhanced Student Learning

We explore the idea of integrating machine learning (ML) with high perfo...
research
04/18/2022

A Taxonomy of Error Sources in HPC I/O Machine Learning Models

I/O efficiency is crucial to productivity in scientific computing, but t...
research
10/06/2021

Colmena: Scalable Machine-Learning-Based Steering of Ensemble Simulations for High Performance Computing

Scientific applications that involve simulation ensembles can be acceler...
research
12/18/2018

A Preliminary Study of Neural Network-based Approximation for HPC Applications

Machine learning, as a tool to learn and model complicated (non)linear r...

Please sign up or login with your details

Forgot password? Click here to reset