Demystifying the Performance of Data Transfers in High-Performance Research Networks

08/20/2023
by   Ehsan Saeedizade, et al.
0

High-speed research networks are built to meet the ever-increasing needs of data-intensive distributed workflows. However, data transfers in these networks often fail to attain the promised transfer rates for several reasons, including I/O and network interference, server misconfigurations, and network anomalies. Although understanding the root causes of performance issues is critical to mitigating them and increasing the utilization of expensive network infrastructures, there is currently no available mechanism to monitor data transfers in these networks. In this paper, we present a scalable, end-to-end monitoring framework to gather and store key performance metrics for file transfers to shed light on the performance of transfers. The evaluation results show that the proposed framework can monitor up to 400 transfers per host and more than 40, 000 transfers in total while collecting performance statistics at one-second precision. We also introduce a heuristic method to automatically process the gathered performance metrics and identify the root causes of performance anomalies with an F-score of 87 - 98

READ FULL TEXT
research
09/05/2022

FIRED: a fine-grained robust performance diagnosis framework for cloud applications

To run a cloud application with the required service quality, operators ...
research
03/30/2022

CMMD: Cross-Metric Multi-Dimensional Root Cause Analysis

In large-scale online services, crucial metrics, a.k.a., key performance...
research
07/24/2019

Live Forensics for Distributed Storage Systems

We present Kaleidoscope an innovative system that supports live forensic...
research
03/07/2023

Root Cause Identification for Collective Anomalies in Time Series given an Acyclic Summary Causal Graph with Loops

This paper presents an approach for identifying the root causes of colle...
research
06/29/2023

A Survey on Enterprise Network Security: Asset Behavioral Monitoring and Distributed Attack Detection

Enterprise networks that host valuable assets and services are popular a...
research
01/22/2019

RICERCANDO: Data Mining Toolkit for Mobile Broadband Measurements

Increasing reliance on mobile broadband (MBB) networks for communication...
research
04/18/2023

Understand Data Preprocessing for Effective End-to-End Training of Deep Neural Networks

In this paper, we primarily focus on understanding the data preprocessin...

Please sign up or login with your details

Forgot password? Click here to reset