KRCORE: a microsecond-scale RDMA control plane for elastic computing

12/29/2021
by   Xingda Wei, et al.
0

This paper presents KRCORE, an RDMA library with a microsecond-scale control plane on commodity RDMA-capable network interface cards. KRCORE can establish a network connection to any node in the cluster within 10μs (hundreds or thousands of times faster than existing approaches), while only maintaining a (small) fixed-size connection metadata at each node, regardless of the cluster scale. The key ideas include virtualizing a pre-initialized kernel-space RDMA connection instead of creating one from scratch, and retrofitting advanced RDMA dynamic connected transport with static transport for both low connection overhead and high resource utilization. KRCORE can shorten the worker bootstrap time of existing disaggregated key-value store (namely RACE Hashing) by 83 under load spikes. In serverless computing (namely Fn), another popular elastic and ephemeral computing system, KRCORE can reduce the latency for transferring data through RDMA by 99

READ FULL TEXT
research
06/20/2023

λFS: A Scalable and Elastic Distributed File System Metadata Service using Serverless Functions

The metadata service (MDS) sits on the critical path for distributed fil...
research
04/05/2020

Kollaps: Decentralized and Dynamic Topology Emulation

The performance and behavior of large-scale distributed applications is ...
research
03/28/2023

EJ-FAT Joint ESnet JLab FPGA Accelerated Transport Load Balancer

To increase the science rate for high data rates/volumes, Thomas Jeffers...
research
07/12/2023

Faster Control Plane Experimentation with Horse

Simulation and emulation are popular approaches for experimentation in C...
research
02/04/2022

On the Efficiency and Quality of Protection of Preprovisioning in Elastic Optical Networks

The study of protection techniques, such as pre-provisioning (off-line) ...
research
08/05/2022

Scale-friendly In-network Coordination

The programmability of modern network devices has led to innovative rese...
research
12/02/2019

GPU Algorithm for Earliest Arrival Time Problem in Public Transport Networks

Given a temporal graph G, a source vertex s, and a departure time at sou...

Please sign up or login with your details

Forgot password? Click here to reset