Developing Distributed High-performance Computing Capabilities of an Open Science Platform for Robust Epidemic Analysis

04/27/2023
by   Nicholson Collier, et al.
0

COVID-19 had an unprecedented impact on scientific collaboration. The pandemic and its broad response from the scientific community has forged new relationships among domain experts, mathematical modelers, and scientific computing specialists. Computationally, however, it also revealed critical gaps in the ability of researchers to exploit advanced computing systems. These challenging areas include gaining access to scalable computing systems, porting models and workflows to new systems, sharing data of varying sizes, and producing results that can be reproduced and validated by others. Informed by our team's work in supporting public health decision makers during the COVID-19 pandemic and by the identified capability gaps in applying high-performance computing (HPC) to the modeling of complex social systems, we present the goals, requirements, and initial implementation of OSPREY, an open science platform for robust epidemic analysis. The prototype implementation demonstrates an integrated, algorithm-driven HPC workflow architecture, coordinating tasks across federated HPC resources, with robust, secure and automated access to each of the resources. We demonstrate scalable and fault-tolerant task execution, an asynchronous API to support fast time-to-solution algorithms, an inclusive, multi-language approach, and efficient wide-area data management. The example OSPREY code is made available on a public repository.

READ FULL TEXT

page 1

page 9

research
08/08/2023

NSF RESUME HPC Workshop: High-Performance Computing and Large-Scale Data Management in Service of Epidemiological Modeling

The NSF-funded Robust Epidemic Surveillance and Modeling (RESUME) projec...
research
11/29/2019

FirecREST: RESTful API on Cray XC systems

As science gateways are becoming an increasingly popular digital interfa...
research
05/13/2021

Toward Real-time Analysis of Experimental Science Workloads on Geographically Distributed Supercomputers

Massive upgrades to science infrastructure are driving data velocities u...
research
03/26/2021

Secure Platform for Processing Sensitive Data on Shared HPC Systems

High performance computing clusters operating in shared and batch mode p...
research
06/12/2020

Workflow environments for advanced cyberinfrastructure platforms

Progress in science is deeply bound to the effective use of high-perform...
research
03/24/2020

AiiDA 1.0, a scalable computational infrastructure for automated reproducible workflows and data provenance

The ever-growing availability of computing power and the sustained devel...
research
05/28/2022

HPC Extensions to the OpenKIM Processing Pipeline

The Open Knowledgebase of Interatomic Models (OpenKIM) is an NSF Science...

Please sign up or login with your details

Forgot password? Click here to reset