Fault Localization in Large-Scale Network Policy Deployment

12/21/2017
by   Praveen Tammana, et al.
0

The recent advances in network management automation and Software-Defined Networking (SDN) are easing network policy management tasks. At the same time, these new technologies create a new mode of failure in the management cycle itself. Network policies are presented in an abstract model at a centralized controller and deployed as low-level rules across network devices. Thus, any software and hardware element in that cycle can be a potential cause of underlying network problems. In this paper, we present and solve a network policy fault localization problem that arises in operating policy management frameworks for a production network. We formulate our problem via risk modeling and propose a greedy algorithm that quickly localizes faulty policy objects in the network policy. We then design and develop SCOUT---a fully-automated system that produces faulty policy objects and further pinpoints physical-level failures which made the objects faulty. Evaluation results using a real testbed and extensive simulations demonstrate that SCOUT detects faulty objects with small false positives and false negatives.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2020

A Security Policy Model Transformation and Verification Approach for Software Defined Networking

Software defined networking (SDN) has been adopted to enforce the securi...
research
01/10/2023

A Practical Runtime Security Policy Transformation Framework for Software Defined Networks

Software-defined networking (SDN) has been widely utilized to enforce th...
research
02/05/2019

Rama: Controller Fault Tolerance in Software-Defined Networking Made Practical

In Software-Defined Networking (SDN), network applications use the logic...
research
04/01/2019

Smart Routing: Towards Proactive Fault-Handling in Software-Defined Networks

Software-defined networking offers numerous benefits against the legacy ...
research
05/05/2023

Flock: Accurate network fault localization at scale

Inferring the root cause of failures among thousands of components in a ...
research
08/21/2018

NFV and SDN - based Distributed IoT Gateway for Large-Scale Disaster Management

Large-scale disaster management applications are among the several reali...
research
01/23/2019

Enhancing MapReduce Fault Recovery Through Binocular Speculation

MapReduce speculation plays an important role in finding potential task ...

Please sign up or login with your details

Forgot password? Click here to reset