Fault Tolerance for Service Function Chains

01/10/2020
by   Milad Ghaznavi, et al.
0

Traffic in enterprise networks typically traverses a sequence of middleboxes forming a service function chain, or simply a chain. The ability to tolerate failures when they occur along chains is imperative to the availability and reliability of enterprise applications. Service outages due to chain failures severely impact customers and cause significant financial losses. Making a chain fault-tolerant is challenging since, in the case of failures, the state of faulty middleboxes must be correctly and quickly recovered while providing high throughput and low latency. In this paper, we present FTC, a novel system design and protocol for fault-tolerant service function chaining. FTC provides strong consistency with up to f middlebox failures for chains of length f + 1 or longer without requiring dedicated replica nodes. In FTC, state updates caused by packet processing at a middlebox are collected, piggybacked onto the packet, and sent along the chain to be replicated. We implement and evaluate a prototype of FTC. Our results for a chain of 2-5 middleboxes show that FTC improves throughput by 2-3.5x compared with state of the art [50] and adds only 20 us latency overhead per middlebox. In a geo-distributed Cloud deployment, our system recovers lost state in  271 ms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2021

Reliable Actors with Retry Orchestration

Enterprise cloud developers have to build applications that are resilien...
research
02/22/2018

NetChain: Scale-Free Sub-RTT Coordination (Extended Version)

Coordination services are a fundamental building block of modern cloud s...
research
05/15/2020

Efficient Network Function Backup by Update Piggybacking

Network Function Virtualization (NFV) and Service Function Chaining (SFC...
research
02/11/2021

Chiron: Optimizing Fault Tolerance in QoS-aware Distributed Stream Processing Jobs

Fault tolerance is a property which needs deeper consideration when deal...
research
05/26/2018

PAM: When Overloaded, Push Your Neighbor Aside!

Recently SmartNICs are widely used to accelerate service chains in NFV. ...
research
09/06/2021

Khaos: Dynamically Optimizing Checkpointing for Dependable Distributed Stream Processing

Distributed Stream Processing systems are becoming an increasingly essen...
research
08/01/2020

Service Chain Composition with Failures in NFV Systems: A Game-Theoretic Perspective

For state-of-the-art network function virtualization (NFV) systems, it r...

Please sign up or login with your details

Forgot password? Click here to reset