PowerTCP: Pushing the Performance Limits of Datacenter Networks

12/28/2021
by   Vamsi Addanki, et al.
0

Increasingly stringent throughput and latency requirements in datacenter networks demand fast and accurate congestion control. We observe that the reaction time and accuracy of existing datacenter congestion control schemes are inherently limited. They either rely only on explicit feedback about the network state (e.g., queue lengths in DCTCP) or only on variations of state (e.g., RTT gradient in TIMELY). To overcome these limitations, we propose a novel congestion control algorithm, PowerTCP, which achieves much more fine-grained congestion control by adapting to the bandwidth-window product (henceforth called power). PowerTCP leverages in-band network telemetry to react to changes in the network instantaneously without loss of throughput and while keeping queues short. Due to its fast reaction time, our algorithm is particularly well-suited for dynamic network environments and bursty traffic patterns. We show analytically and empirically that PowerTCP can significantly outperform the state-of-the-art in both traditional datacenter topologies and emerging reconfigurable datacenters where frequent bandwidth changes make congestion control challenging. In traditional datacenter networks, PowerTCP reduces tail flow completion times of short flows by 80 TIMELY, and by 33 datacenters, PowerTCP achieves 85 additional latency and cuts tail latency by at least 2x compared to existing approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2018

Pulser: Fast Congestion Response using Explicit Incast Notifications for Datacenter Networks

Datacenter applications frequently cause incast congestion, which degrad...
research
07/05/2018

Slytherin: Dynamic, Network-assisted Prioritization of Tail Packets in Datacenter Networks

Datacenter applications demand both low latency and high throughput; whi...
research
05/28/2018

Dart: Divide and Specialize for Fast Response to Congestion in RDMA-based Datacenter Networks

Though Remote Direct Memory Access (RDMA) promises to reduce datacenter ...
research
09/22/2019

Backpressure Flow Control

Effective congestion control in a multi-tenant data center is becoming i...
research
02/24/2023

Machine Learning-based Low Overhead Congestion Control Algorithm for Industrial NoCs

Network-on-Chip (NoC) congestion builds up during heavy traffic load and...
research
08/07/2023

Robustifying Measurement-Based Congestion Control Algorithms

The design methodology of congestion control algorithms (CCAs) has shift...
research
12/16/2018

Fast and Efficient Bulk Multicasting over Dedicated Inter-Datacenter Networks

Several organizations have built multiple datacenters connected via dedi...

Please sign up or login with your details

Forgot password? Click here to reset