Micro-Architectural features as soft-error induced fault executions markers in embedded safety-critical systems: a preliminary study

11/23/2022
by   Deniz Kasap, et al.
0

Radiation-induced soft errors are one of the most challenging issues in Safety Critical Real-Time Embedded System (SACRES) reliability, usually handled using different flavors of Double Modular Redundancy (DMR) techniques. This solution is becoming unaffordable due to the complexity of modern micro-processors in all domains. This paper addresses the promising field of using Artificial Intelligence (AI) based hardware detectors for soft errors. To create such cores and make them general enough to work with different software applications, microarchitectural attributes are a fascinating option as candidate fault detection features. Several processors already track these features through dedicated Performance Monitoring Unit (PMU). However, there is an open question to understand to what extent they are enough to detect faulty executions. Exploiting the capability of gem5 to simulate real computing systems, perform fault injection experiments and profile microarchitectural attributes (i.e., gem5 Stats), this paper presents the results of a comprehensive analysis regarding the potential attributes to detect soft error and the associated models that can be trained with these features.

READ FULL TEXT

page 1

page 4

research
10/23/2013

Quest-V: A Virtualized Multikernel for Safety-Critical Real-Time Systems

Modern processors are increasingly featuring multiple cores, as well as ...
research
06/04/2022

Fast and Accurate Error Simulation for CNNs against Soft Errors

The great quest for adopting AI-based computation for safety-/mission-cr...
research
06/17/2022

Experimental evaluation of neutron-induced errors on a multicore RISC-V platform

RISC-V architectures have gained importance in the last years due to the...
research
01/24/2020

Accelerating Transient Fault Injection Campaigns by using Dynamic HDL Slicing

Along with the complexity of electronic systems for safety-critical appl...
research
10/18/2020

A CAD-Based tool for fault tolerant distributed embedded systems

Reliability and availability analysis are essential in dependable critic...
research
10/12/2022

Statistical Modeling of Soft Error Influence on Neural Networks

Soft errors in large VLSI circuits pose dramatic influence on computing-...
research
07/28/2023

SafeLS: Toward Building a Lockstep NOEL-V Core

Safety-critical systems such as those in automotive, avionics and space,...

Please sign up or login with your details

Forgot password? Click here to reset