Dynamic Fault Tolerance Through Resource Pooling

02/22/2019
by   Christian M. Fuchs, et al.
0

Miniaturized satellites are currently not considered suitable for critical, high-priority, and complex multi-phased missions, due to their low reliability. As hardware-side fault tolerance (FT) solutions designed for larger spacecraft can not be adopted aboard very small satellites due to budget, energy, and size constraints, we developed a hybrid FT-approach based upon only COTS components, commodity processor cores, library IP, and standard software. This approach facilitates fault detection, isolation, and recovery in software, and utilizes fault-coverage techniques across the embedded stack within an multiprocessor system-on-chip (MPSoC). This allows our FPGA-based proof-of-concept implementation to deliver strong fault-coverage even for missions with a long duration, but also to adapt to varying performance requirements during the mission. The operator of a spacecraft utilizing this approach can define performance profiles, which allow an on-board computer (OBC) to trade between processing capacity, fault coverage, and energy consumption using simple heuristics. The software-side FT approach developed also offers advantages if deployed aboard larger spacecraft through spare resource pooling, enabling an OBC to more efficiently handle permanent faults. This FT approach in part mimics a critical biological systems's way of tolerating and adjusting to failures, enabling graceful ageing of an MPSoC.

READ FULL TEXT
research
03/21/2019

Fault-Tolerant Nanosatellite Computing on a Budget

Micro- and nanosatellites have become popular platforms for a variety of...
research
08/23/2017

Bringing Fault-Tolerant GigaHertz-Computing to Space: A Multi-Stage Software-Side Fault-Tolerance Approach for Miniaturized Spacecraft

Modern embedded technology is a driving factor in satellite miniaturizat...
research
03/15/2022

A Survey of fault models and fault tolerance methods for 2D bus-based multi-core systems and TSV based 3D NOC many-core systems

Reliability has taken centre stage in the development of high-performanc...
research
05/04/2016

A Qualitative Comparison of MPSoC Mobile and Embedded Virtualization Techniques

Virtualization is generally adopted in server and desktop environments t...
research
09/02/2021

A Novel Compaction Approach for SBST Test Programs

In-field test of processor-based devices is a must when considering safe...
research
09/23/2022

Analysis of Fault Tolerant Multi-stage Switch Architecture for TSN

We conducted the feasibility analysis of utilizing a highly available mu...
research
02/05/2019

Enhancing Fault Tolerance of Neural Networks for Security-Critical Applications

Neural Networks (NN) have recently emerged as backbone of several sensit...

Please sign up or login with your details

Forgot password? Click here to reset