NetClone: Fast, Scalable, and Dynamic Request Cloning for Microsecond-Scale RPCs

07/25/2023
by   Gyuyeong Kim, et al.
0

Spawning duplicate requests, called cloning, is a powerful technique to reduce tail latency by masking service-time variability. However, traditional client-based cloning is static and harmful to performance under high load, while a recent coordinator-based approach is slow and not scalable. Both approaches are insufficient to serve modern microsecond-scale Remote Procedure Calls (RPCs). To this end, we present NetClone, a request cloning system that performs cloning decisions dynamically within nanoseconds at scale. Rather than the client or the coordinator, NetClone performs request cloning in the network switch by leveraging the capability of programmable switch ASICs. Specifically, NetClone replicates requests based on server states and blocks redundant responses using request fingerprints in the switch data plane. To realize the idea while satisfying the strict hardware constraints, we address several technical challenges when designing a custom switch data plane. NetClone can be integrated with emerging in-network request schedulers like RackSched. We implement a NetClone prototype with an Intel Tofino switch and a cluster of commodity servers. Our experimental results show that NetClone can improve the tail latency of microsecond-scale RPCs for synthetic and real-world application workloads and is robust to various system conditions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2020

RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers (Technical Report)

Low-latency online services have strict Service Level Objectives (SLOs) ...
research
10/26/2020

LB Scalability: Achieving the Right Balance Between Being Stateful and Stateless

A high performance Layer-4 load balancer (LB) is one of the most importa...
research
06/09/2020

Parking Packet Payload with P4

Network Function (NF) deployments suffer from poor link goodput, because...
research
02/12/2020

Taurus: An Intelligent Data Plane

Emerging applications – cloud computing, the internet of things, and aug...
research
05/29/2020

Programmable In-Network Obfuscation of Traffic

Recent advances in programmable switch hardware offer a fresh opportunit...
research
09/19/2023

EdgeP4: A P4-Programmable Edge Intelligent Ethernet Switch for Tactile Cyber-Physical Systems

Tactile Internet based operations, e.g., telesurgery, rely on end-to-end...
research
03/27/2022

Algorithmic support of a personal virtual assistant for automating the processing of client requests

This article describes creating algorithmic support for the functioning ...

Please sign up or login with your details

Forgot password? Click here to reset