Optimizing Tail Latency in Commodity Datacenters using Forward Error Correction

10/28/2021
by   Zeng Gaoxiong, et al.
0

Long tail latency of short flows (or messages) greatly affects user-facing applications in datacenters. Prior solutions to the problem introduce significant implementation complexities, such as global state monitoring, complex network control, or non-trivial switch modifications. While promising superior performance, they are hard to implement in practice. This paper presents CloudBurst, a simple, effective yet readily deployable solution achieving similar or even better results without introducing the above complexities. At its core, CloudBurst explores forward error correction (FEC) over multipath - it proactively spreads FEC-coded packets generated from messages over multipath in parallel, and recovers them with the first few arriving ones. As a result, CloudBurst is able to obliviously exploit underutilized paths, thus achieving low tail latency. We have implemented CloudBurst as a user-space library, and deployed it on a testbed with commodity switches. Our testbed and simulation experiments show the superior performance of CloudBurst. For example, CloudBurst achieves 63.69 99th percentile message/flow completion time (FCT) compared to DCTCP and PIAS, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2014

RepNet: Cutting Tail Latency in Data Center Networks with Flow Replication

Data center networks need to provide low latency, especially at the tail...
research
07/05/2018

Slytherin: Dynamic, Network-assisted Prioritization of Tail Packets in Datacenter Networks

Datacenter applications demand both low latency and high throughput; whi...
research
03/02/2023

Automorphism Ensemble Polar Code Decoders for 6G URLLC

The URLLC scenario in the upcoming 6G standard requires low latency and ...
research
07/11/2022

Forward Error Correction applied to JPEG-XS codestreams

JPEG-XS offers low complexity image compression for applications with co...
research
09/29/2018

On Minimizing the Completion Times of Long Flows over Inter-Datacenter WAN

Long flows contribute huge volumes of traffic over inter-datacenter WAN....
research
05/26/2019

Evaluation of basic modules for isolated spelling error correction in Polish texts

Spelling error correction is an important problem in natural language pr...

Please sign up or login with your details

Forgot password? Click here to reset