Asymmetric distribution of data products from WALLABY, an SKA precursor neutral hydrogen survey

03/21/2023
by   Manuel Parra-Royon, et al.
0

The Widefield ASKAP L-band Legacy All-sky Blind surveY (WALLABY) is a neutral hydrogen survey (HI) that is running on the Australian SKA Pathfinder (ASKAP), a precursor telescope for the Square Kilometre Array (SKA). The goal of WALLABY is to use ASKAP's powerful wide-field phased array feed technology to observe three quarters of the entire sky at the 21 cm neutral hydrogen line with an angular resolution of 30 arcseconds. Post-processing activities at the Australian SKA Regional Centre (AusSRC), Canadian Initiative for Radio Astronomy Data Analysis (CIRADA) and Spanish SKA Regional Centre prototype (SPSRC) will then produce publicly available advanced data products in the form of source catalogues, kinematic models and image cutouts, respectively. These advanced data products will be generated locally at each site and distributed across the network. Over the course of the full survey we expect to replicate data up to 10 MB per source detection, which could imply an ingestion of tens of GB to be consolidated in the other locations near real time. Here, we explore the use of an asymmetric database replication model and strategy, using PostgreSQL as the engine and Bucardo as the asynchronous replication service to enable robust multi-source pools operations with data products from WALLABY. This work would serve to evaluate this type of data distribution solution across globally distributed sites. Furthermore, a set of benchmarks have been developed to confirm that the deployed model is sufficient for future scalability and remote collaboration needs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2023

Integration of storage endpoints into a Rucio data lake, as an activity to prototype a SKA Regional Centres Network

The Square Kilometre Array (SKA) infrastructure will consist of two radi...
research
02/07/2018

Interpolating Distributions for Populations in Nested Geographies using Public-use Data with Application to the American Community Survey

Statistical agencies often publish multiple data products from the same ...
research
03/11/2020

Constellation: A High Performance Geo-Distributed Middlebox Framework

Middleboxes are increasingly deployed across geographically distributed ...
research
11/30/2021

Flood Analytics Information System (FAIS) Version 4.00 Manual

This project was the first attempt to use big data analytics approaches ...
research
04/04/2022

Automated generalisation of buildings using CartAGen platform

In this paper, we present a methodology to automatically derive the gene...
research
08/12/2020

The network footprint of replication in popular DBMSs

Database replication is an important component of reliable, disaster tol...

Please sign up or login with your details

Forgot password? Click here to reset