Rama: Controller Fault Tolerance in Software-Defined Networking Made Practical

02/05/2019
by   André Mantas, et al.
0

In Software-Defined Networking (SDN), network applications use the logically centralized network view provided by the controller to remotely orchestrate the network switches. To avoid the controller being a single point of failure, traditional fault-tolerance techniques are employed to guarantee availability, a fundamental requirement in production environments. Unfortunately, these techniques fall short of ensuring correct network behaviour under controller failures. The problem of these techniques is that they deal with only part of the problem: guaranteeing that application and controller state remains consistent between replicas. However, in an SDN the switches maintain hard state that must also be handled consistently. Fault-tolerant SDN must therefore include switch state into the problem. A recently proposed fault-tolerant controller platform, Ravana, solves this problem by extending fault-tolerant SDN control with mechanisms that guarantee control messages to be processed transactionally and exactly once, at both the controllers and the switches. These guarantees are given even in the face of controller and switch crashes. The elegance of this solution comes at a cost. Ravana requires switches to be modified and OpenFlow to be extended with hitherto unforeseen additions to the protocol. In face of this challenge we propose Rama, a fault-tolerant SDN controller platform that offers the same strong guarantees as Ravana without requiring modifications to switches or to the OpenFlow protocol. Experiments with our prototype implementation show the additional overhead to be modest, making Rama the first fault-tolerant SDN solution that can be immediately deployable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2019

MORPH: An Adaptive Framework for Efficient and Byzantine Fault-Tolerant SDN Control Plane

Current approaches to tackle the single point of failure in SDN entail a...
research
12/20/2017

Renaissance: Self-Stabilizing Distributed SDN Control Plane

By introducing programmability, automated verification, and innovative d...
research
12/26/2019

Fault Tolerance in SDN Data Plane Considering Network and Application Based Metrics

Failures in networks result in service disruptions which may cause deter...
research
12/21/2017

Fault Localization in Large-Scale Network Policy Deployment

The recent advances in network management automation and Software-Define...
research
02/07/2019

BFT Protocols for Heterogeneous Resource Allocations in Distributed SDN Control Plane

Distributed Software Defined Networking (SDN) controllers aim to solve t...
research
08/01/2020

Predictive Switch-Controller Association and Control Devolution for SDN Systems

For software-defined networking (SDN) systems, to enhance the scalabilit...
research
03/05/2021

Quartermaster: A Tool for Modeling and Simulating System Degradation

It is essential that software systems be tolerant to degradations in com...

Please sign up or login with your details

Forgot password? Click here to reset