Linking Scientific Instruments and HPC: Patterns, Technologies, Experiences

04/11/2022
by   Rafael Vescovi, et al.
0

Powerful detectors at modern experimental facilities routinely collect data at multiple GB/s. Online analysis methods are needed to enable the collection of only interesting subsets of such massive data streams, such as by explicitly discarding some data elements or by directing instruments to relevant areas of experimental space. Such online analyses require methods for configuring and running high-performance distributed computing pipelines–what we call flows–linking instruments, HPC (e.g., for analysis, simulation, AI model training), edge computing (for analysis), data stores, metadata catalogs, and high-speed networks. In this article, we review common patterns associated with such flows and describe methods for instantiating those patterns. We also present experiences with the application of these methods to the processing of data from five different scientific instruments, each of which engages HPC resources for data inversion, machine learning model training, or other purposes. We also discuss implications of these new methods for operators and users of scientific facilities.

READ FULL TEXT

page 2

page 7

research
03/31/2023

Workflows Community Summit 2022: A Roadmap Revolution

Scientific workflows have become integral tools in broad scientific comp...
research
07/06/2023

Applying Process Mining on Scientific Workflows: a Case Study

Computer-based scientific experiments are becoming increasingly data-int...
research
03/22/2018

SCISPACE: A Scientific Collaboration Workspace for File Systems in Geo-Distributed HPC Data Centers

Future terabit networks are committed to dramatically improving big data...
research
11/22/2021

High-Performance Ptychographic Reconstruction with Federated Facilities

Beamlines at synchrotron light source facilities are powerful scientific...
research
06/16/2022

Modifying the Asynchronous Jacobi Method for Data Corruption Resilience

Moving scientific computation from high-performance computing (HPC) and ...
research
08/18/2023

Towards a Modular Architecture for Science Factories

Advances in robotic automation, high-performance computing (HPC), and ar...
research
12/19/2022

Pseudonymization at Scale: OLCF's Summit Usage Data Case Study

The analysis of vast amounts of data and the processing of complex compu...

Please sign up or login with your details

Forgot password? Click here to reset