ThorFI: A Novel Approach for Network Fault Injection as a Service

01/19/2022
by   Domenico Cotroneo, et al.
0

In this work, we present a novel fault injection solution (ThorFI) for virtual networks in cloud computing infrastructures. ThorFI is designed to provide non-intrusive fault injection capabilities for a cloud tenant, and to isolate injections from interfering with other tenants on the infrastructure. We present the solution in the context of the OpenStack cloud management platform, and release this implementation as open-source software. Finally, we present two relevant case studies of ThorFI, respectively in an NFV IMS and of a high-availability cloud application. The case studies show that ThorFI can enhance functional tests with fault injection, as in 4 the IMS is unable to handle faults; and that despite redundancy in virtual networks, faults in one virtual network segment can propagate to other segments, and can affect the throughput and response time of the cloud application as a whole, by about 3 times in the worst case.

READ FULL TEXT
research
09/30/2020

Fault Injection Analytics: A Novel Approach to Discover Failure Modes in Cloud-Computing Systems

Cloud computing systems fail in complex and unexpected ways due to unexp...
research
03/13/2023

CHESS: A Framework for Evaluation of Self-adaptive Systems based on Chaos Engineering

There is an increasing need to assess the correct behavior of self-adapt...
research
01/23/2021

Resilient Virtualized Systems Using ReHype

System-level virtualization introduces critical vulnerabilities to failu...
research
12/11/2020

IBIR: Bug Report driven Fault Injection

Much research on software engineering and software testing relies on exp...
research
05/28/2021

ARMORY: Fully Automated and Exhaustive Fault Simulation on ARM-M Binaries

Embedded systems are ubiquitous. However, physical access of users and l...
research
02/05/2020

Component-aware Orchestration of Cloud-based Enterprise Applications, from TOSCA to Docker and Kubernetes

Enterprise IT is currently facing the challenge of coordinating the mana...
research
01/26/2018

Enhancing Byzantine fault tolerance using MD5 checksum and delay variation in Cloud services

Cloud computing management are beyond typical human narratives. However ...

Please sign up or login with your details

Forgot password? Click here to reset