A Study of Network Congestion in Two Supercomputing High-Speed Interconnects

07/11/2019
by   Saurabh Jha, et al.
0

Network congestion in high-speed interconnects is a major source of application run time performance variation. Recent years have witnessed a surge of interest from both academia and industry in the development of novel approaches for congestion control at the network level and in application placement, mapping, and scheduling at the system-level. However, these studies are based on proxy applications and benchmarks that are not representative of field-congestion characteristics of high-speed interconnects. To address this gap, we present (a) an end-to-end framework for monitoring and analysis to support long-term field-congestion characterization studies, and (b) an empirical study of network congestion in petascale systems across two different interconnect technologies: (i) Cray Gemini, which uses a 3-D torus topology, and (ii) Cray Aries, which uses the DragonFly topology.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2021

An Algorithm for Flow Control in Computer Networks Based in Discrete Control Theory

Developing of an effective flow control algorithm to avoid congestion is...
research
03/09/2021

Congestion control in high-speed networks using the probabilistic estimation approach

Nowadays, the bulk of Internet traffic uses TCP protocol for reliable tr...
research
12/14/2020

Application-aware Congestion Mitigation for High-Performance Computing Systems

High-performance computing (HPC) systems frequently experience congestio...
research
07/22/2022

Impact of RoCE Congestion Control Policies on Distributed Training of DNNs

RDMA over Converged Ethernet (RoCE) has gained significant attraction fo...
research
02/27/2019

Security Function Analysis on Performance of High-Speed Router Networking

A router is a device, in the computer networks, that is used to forward ...
research
01/29/2018

Using High-Speed WANs and Network Data Caches to Enable Remote and Distributed Visualization

Visapult is a prototype application and framework for remote visualizati...
research
09/05/2019

Is two greater than one?: Analyzing Multipath TCP over Dual-LTE in the Wild

Multipath TCP (MPTCP) is a standardized TCP extension which allows end-h...

Please sign up or login with your details

Forgot password? Click here to reset