Artificial neural networks for online error detection

11/27/2021
by   Vassilis Vassiliadis, et al.
0

Hardware reliability is adversely affected by the downscaling of semiconductor devices and the scale-out of systems necessitated by modern applications. Apart from crashes, this unreliability often manifests as silent data corruptions (SDCs), affecting application output. Therefore, we need low-cost and low-human-effort solutions to reduce the incidence rate and the effects of SDCs on the quality of application outputs. We propose Artificial Neural Networks (ANNs) as an effective mechanism for online error detection. We train ANNs using software fault injection. We find that the average overhead of our approach, followed by a costly error correction by re-execution, is 6.45 in terms of CPU cycles. We also report that ANNs discover 94.85 thereby resulting in minimal output quality degradation. To validate our approach we overclock ARM Cortex A53 CPUs, execute benchmarks on them and record the program outputs. ANNs prove to be an efficient error detection mechanism, better than a state of the art approximate error detection mechanism (Topaz), both in terms of performance (12.81 application output (94.11

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2022

Fast and Accurate Error Simulation for CNNs against Soft Errors

The great quest for adopting AI-based computation for safety-/mission-cr...
research
11/02/2021

HASHTAG: Hash Signatures for Online Detection of Fault-Injection Attacks on Deep Neural Networks

We propose HASHTAG, the first framework that enables high-accuracy detec...
research
10/23/2018

Criticality Aware Soft Error Mitigation in the Configuration Memory of SRAM based FPGA

Efficient low complexity error correcting code(ECC) is considered as an ...
research
01/26/2023

Secure synchronization of artificial neural networks used to correct errors in quantum cryptography

Quantum cryptography can provide a very high level of data security. How...
research
10/16/2022

Towards Dynamic Fault Tolerance for Hardware-Implemented Artificial Neural Networks: A Deep Learning Approach

The functionality of electronic circuits can be seriously impaired by th...
research
02/03/2023

Deep Reinforcement Learning for Online Error Detection in Cyber-Physical Systems

Reliability is one of the major design criteria in Cyber-Physical System...
research
09/23/2022

Estimating Model Error Covariances with Artificial Neural Networks

Methods to deal with systematic model errors are an increasingly importa...

Please sign up or login with your details

Forgot password? Click here to reset