Leveraging User Access Patterns and Advanced Cyberinfrastructure to Accelerate Data Delivery from Shared-use Scientific Observatories

12/30/2020
by   Yubo Qin, et al.
0

With the growing number and increasing availability of shared-use instruments and observatories, observational data is becoming an essential part of application workflows and contributor to scientific discoveries in a range of disciplines. However, the corresponding growth in the number of users accessing these facilities coupled with the expansion in the scale and variety of the data, is making it challenging for these facilities to ensure their data can be accessed, integrated, and analyzed in a timely manner, and is resulting significant demands on their cyberinfrastructure (CI). In this paper, we present the design of a push-based data delivery framework that leverages emerging in-network capabilities, along with data pre-fetching techniques based on a hybrid data management model. Specifically, we analyze data access traces for two large-scale observatories, Ocean Observatories Initiative (OOI) and Geodetic Facility for the Advancement of Geoscience (GAGE), to identify typical user access patterns and to develop a model that can be used for data pre-fetching. Furthermore, we evaluate our data pre-fetching model and the proposed framework using a simulation of the Virtual Data Collaboratory (VDC) platform that provides in-network data staging and processing capabilities. The results demonstrate that the ability of the framework to significantly improve data delivery performance and reduce network traffic at the observatories' facilities.

READ FULL TEXT

page 1

page 12

research
12/13/2021

Toward Democratizing Access to Facilities Data: A Framework for Intelligent Data Discovery and Delivery

Data collected by large-scale instruments, observatories, and sensor net...
research
05/03/2021

Analyzing scientific data sharing patterns for in-network data caching

The volume of data moving through a network increases with new scientifi...
research
04/22/2022

Some Optimization Solutions for Relief Distribution

Humanitarian logistics remain a challenging area of application for oper...
research
04/28/2023

Timely Mobile Routing: An Experimental Study

Time-critical cyber-physical applications demand the timely delivery of ...
research
05/29/2021

Log2NS: Enhancing Deep Learning Based Analysis of Logs With Formal to Prevent Survivorship Bias

Analysis of large observational data sets generated by a reactive system...
research
11/26/2020

Communication, Computing, Caching, and Sensing for Next Generation Aerial Delivery Networks

This paper describes the envisioned interactions between the information...

Please sign up or login with your details

Forgot password? Click here to reset