The LBNL Superfacility Project Report

06/23/2022
by   Deborah Bard, et al.
0

The Superfacility model is designed to leverage HPC for experimental science. It is more than simply a model of connected experiment, network, and HPC facilities; it encompasses the full ecosystem of infrastructure, software, tools, and expertise needed to make connected facilities easy to use. The three-year Lawrence Berkeley National Laboratory (LBNL) Superfacility project was initiated in 2019 to coordinate work being performed at LBNL to support this model, and to provide a coherent and comprehensive set of science requirements to drive existing and new work. A key component of the project was the in-depth engagements with eight science teams that represent challenging use cases across the DOE Office of Science. By the close of the project, we met our project goal by enabling our science application engagements to demonstrate automated pipelines that analyze data from remote facilities at large scale, without routine human intervention. In several cases, we have gone beyond demonstrations and now provide production-level services. To achieve this goal, the Superfacility team developed tools, infrastructure, and policies for near-real-time computing support, dynamic high-performance networking, data management and movement tools, API-driven automation, HPC-scale notebooks via Jupyter, authentication using Federated Identity and container-based edge services supported. The lessons we learned during this project provide a valuable model for future large, complex, cross-disciplinary collaborations. There is a pressing need for a coherent computing infrastructure across national facilities, and LBNL's Superfacility project is a unique model for success in tackling the challenges that will be faced in hardware, software, policies, and services across multiple science domains.

READ FULL TEXT

page 1

page 10

page 12

page 13

page 14

page 15

page 18

page 19

research
10/06/2018

Supporting High-Performance and High-Throughput Computing for Experimental Science

The advent of experimental science facilities, instruments and observato...
research
12/01/2019

LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing

The LEGaTO project leverages task-based programming models to provide a ...
research
11/29/2019

FirecREST: RESTful API on Cray XC systems

As science gateways are becoming an increasingly popular digital interfa...
research
10/09/2020

Analyzing HPC Support Tickets: Experience and Recommendations

High performance computing (HPC) user support teams are the first line o...
research
03/10/2020

The Locus Algorithm III: A Grid Computing system to generate catalogues of optimised pointings for Differential Photometry

This paper discusses the hardware and software components of the Grid Co...
research
05/13/2021

Toward Real-time Analysis of Experimental Science Workloads on Geographically Distributed Supercomputers

Massive upgrades to science infrastructure are driving data velocities u...
research
12/05/2022

Where the Bee Sucks – A Dynamic Bayesian Network Approach to Decision Support for Pollinator Abundance Strategies

For policymakers wishing to make evidence-based decisions, one of the ch...

Please sign up or login with your details

Forgot password? Click here to reset