Managed Network Services for Exascale Data Movement Across Large Global Scientific Collaborations

09/27/2022
by   Frank Würthwein, et al.
0

Unique scientific instruments designed and operated by large global collaborations are expected to produce Exabyte-scale data volumes per year by 2030. These collaborations depend on globally distributed storage and compute to turn raw data into science. While all of these infrastructures have batch scheduling capabilities to share compute, Research and Education networks lack those capabilities. There is thus uncontrolled competition for bandwidth between and within collaborations. As a result, data "hogs" disk space at processing facilities for much longer than it takes to process, leading to vastly over-provisioned storage infrastructures. Integrated co-scheduling of networks as part of high-level managed workflows might reduce these storage needs by more than an order of magnitude. This paper describes such a solution, demonstrates its functionality in the context of the Large Hadron Collider (LHC) at CERN, and presents the next-steps towards its use in production.

READ FULL TEXT
research
02/27/2023

An algorithm for geo-distributed and redundant storage in Garage

This paper presents an optimal algorithm to compute the assignment of da...
research
03/15/2022

Data Transfer and Network Services management for Domain Science Workflows

This paper describes a vision and work in progress to elevate network re...
research
12/22/2022

A Moveable Beast: Partitioning Data and Compute for Computational Storage

Over the years, hardware trends have introduced various heterogeneous co...
research
12/01/2017

DAOS for Extreme-scale Systems in Scientific Applications

Exascale I/O initiatives will require new and fully integrated I/O model...
research
07/06/2018

The SAGE Project: a Storage Centric Approach for Exascale Computing

SAGE (Percipient StorAGe for Exascale Data Centric Computing) is a Europ...
research
06/22/2022

ROIBIN-SZ: Fast and Science-Preserving Compression for Serial Crystallography

Crystallography is the leading technique to study atomic structures of p...

Please sign up or login with your details

Forgot password? Click here to reset