Design and Evaluation of a Simple Data Interface for Efficient Data Transfer Across Diverse Storage

09/07/2020
by   Zhengchun Liu, et al.
0

Modern science and engineering computing environments often feature storage systems of different types, from parallel file systems in high-performance computing centers to object stores operated by cloud providers. To enable easy, reliable, secure, and performant data exchange among these different systems, we propose Connector, a pluggable data access architecture for diverse, distributed storage. By abstracting low-level storage system details, this abstraction permits a managed data transfer service (Globus in our case) to interact with a large and easily extended set of storage systems. Equally important, it supports third-party transfers: that is, direct data transfers from source to destination that are initiated by a third-party client but do not engage that third party in the data path. The abstraction also enables management of transfers for performance optimization, error handling, and end-to-end integrity. We present the Connector design, describe implementations for different storage services, evaluate tradeoffs inherent in managed vs.direct transfers, motivate recommended deployment options, and propose a performance model-based method that allows for easy characterization of performance in different contexts without exhaustive benchmarking.

READ FULL TEXT

page 4

page 15

page 16

page 17

page 18

page 19

page 20

research
07/08/2021

A Multi-Protocol, Secure, and Dynamic Data Storage Integration Frameworkfor Multi-tenanted Science Gateway Middleware

Science gateways are user-centric, end-to-end cyberinfrastructure for ma...
research
10/26/2020

Enhancing Cloud Storage with Shareable Instances for Social Computing

Cloud storage plays an important role in social computing. This paper ai...
research
05/16/2018

Client-side Straggler-Aware I/O Scheduler for Object-based Parallel File Systems

Object-based parallel file systems have emerged as promising storage sol...
research
06/29/2018

Complying with Data Handling Requirements in Cloud Storage Systems

In past years, cloud storage systems saw an enormous rise in usage. Howe...
research
08/01/2017

Performance Measurements of Supercomputing and Cloud Storage Solutions

Increasing amounts of data from varied sources, particularly in the fiel...
research
04/13/2020

Software-Defined Network for End-to-end Networked Science at the Exascale

Domain science applications and workflow processes are currently forced ...
research
04/24/2023

A Secure Third-Party Auditing Scheme Based on Blockchain Technology in Cloud Storage

With the help of a shared pool of reconfigurable computing resources, cl...

Please sign up or login with your details

Forgot password? Click here to reset