FlexTOE: Flexible TCP Offload with Fine-Grained Parallelism

10/21/2021
by   Rajath Shashidhara, et al.
0

FlexTOE is a flexible, yet high-performance TCP offload engine (TOE) to SmartNICs. FlexTOE eliminates almost all host data-path TCP processing and is fully customizable. FlexTOE interoperates well with other TCP stacks, is robust under adverse network conditions, and supports POSIX sockets. FlexTOE focuses on data-path offload of established connections, avoiding complex control logic and packet buffering in the NIC. FlexTOE leverages fine-grained parallelization of the TCP data-path and segment reordering for high performance on wimpy SmartNIC architectures, while remaining flexible via a modular design. We compare FlexTOE on an Agilio-CX40 to host TCP stacks Linux and TAS, and to the Chelsio Terminator TOE. We find that Memcached scales up to 38 Chelsio. FlexTOE provides competitive performance for RPCs, even with wimpy SmartNICs. FlexTOE cuts 99.99th-percentile RPC RTT by 3.2× and 50 versus Chelsio and TAS, respectively. FlexTOE's data-path parallelism generalizes across hardware architectures, improving single connection RPC throughput up to 2.4× on x86 and 4× on BlueField. FlexTOE supports C and XDP programs written in eBPF. It allows us to implement popular data center transport features, such as TCP tracing, packet filtering and capture, VLAN stripping, flow classification, firewalling, and connection splicing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2023

Generalizing Hierarchical Parallelism

Since the days of OpenMP 1.0 computer hardware has become more complex, ...
research
02/18/2019

Achieving Throughput via Fine-Grained Path Planning in Small World DTNs

We explore the benefits of using fine-grained statistics in small world ...
research
12/02/2021

The Packet Number Space Debate in Multipath QUIC

With a standardization process that attracted many interest, QUIC can be...
research
11/12/2018

The Impact of Timestamp Granularity in Optimistic Concurrency Control

Optimistic concurrency control (OCC) can exploit the strengths of parall...
research
09/24/2018

Software for Sparse Tensor Decomposition on Emerging Computing Architectures

In this paper, we develop software for decomposing sparse tensors that i...
research
02/04/2022

Direct Telemetry Access

The emergence of programmable switches allows operators to collect a vas...
research
09/02/2019

Next Event Backtracking

In light transport simulation, challenging situations are caused by the ...

Please sign up or login with your details

Forgot password? Click here to reset