Integration of storage endpoints into a Rucio data lake, as an activity to prototype a SKA Regional Centres Network

03/13/2023
by   Manuel Parra-Royon, et al.
0

The Square Kilometre Array (SKA) infrastructure will consist of two radio telescopes that will be the most sensitive telescopes on Earth. The SKA community will have to process and manage near exascale data, which will be a technical challenge for the coming years. In this respect, the SKA Global Network of Regional Centres plays a key role in data distribution and management. The SRCNet will provide distributed computing and data storage capacity, as well as other important services for the network. Within the SRCNet, several teams have been set up for the research, design and development of 5 prototypes. One of these prototypes is related to data management and distribution, where a data lake has been deployed using Rucio. In this paper we focus on the tasks performed by several of the teams to deploy new storage endpoints within the SKAO data lake. In particular, we will describe the steps and deployment instructions for the services required to provide the Rucio data lake with a new Rucio Storage Element based on StoRM and WebDAV within the Spanish SRC prototype.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2019

Enabling Microsoft OneDrive Integration with HTCondor

Accessing data from distributed computing is essential in many workflows...
research
03/21/2023

Asymmetric distribution of data products from WALLABY, an SKA precursor neutral hydrogen survey

The Widefield ASKAP L-band Legacy All-sky Blind surveY (WALLABY) is a ne...
research
01/27/2022

A distributed computing infrastructure for LOFAR Italian community

The LOw-Frequency ARray is a low-frequency radio interferometer composed...
research
11/27/2019

Dynamically Provisioning Cray DataWarp Storage

Complex applications and workflows needs are often exclusively expressed...
research
10/03/2022

Distributed-Something: scripts to leverage AWS storage and computing for distributed workflows at scale

Distributed-Something coordinates the distribution of any Dockerized wor...
research
12/01/2016

Operations in the era of large distributed telescopes

The previous generation of astronomical instruments tended to consist of...

Please sign up or login with your details

Forgot password? Click here to reset