Efficient Soft-Error Detection for Low-precision Deep Learning Recommendation Models

02/27/2021
by   Sihuan Li, et al.
0

Soft error, namely silent corruption of signal or datum in a computer system, cannot be caverlierly ignored as compute and communication density grow exponentially. Soft error detection has been studied in the context of enterprise computing, high-performance computing and more recently in convolutional neural networks related to autonomous driving. Deep learning recommendation systems (DLRMs) have by now become ubiquitous and serve billions of users per day. Nevertheless, DLRM-specific soft error detection methods are hitherto missing. To fill the gap, this paper presents the first set of soft-error detection methods for low-precision quantized-arithmetic operators in DLRM including general matrix multiplication (GEMM) and EmbeddingBag. A practical method must detect error and do so with low overhead lest reduced inference speed degrades user experience. Exploiting the characteristics of both quantized arithmetic and the operators, we achieved more than 95 detection accuracy for GEMM with an overhead below 20 achieved 99 10

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2017

Training Quantized Nets: A Deeper Understanding

Currently, deep neural networks are deployed on low-power portable devic...
research
11/29/2018

Soft-Output Detection Methods for Sparse Millimeter Wave MIMO Systems with Low-Precision ADCs

The use of low-precision analog-to-digital converters (ADCs) is a low-co...
research
01/17/2021

Acceleration of multiple precision matrix multiplication based on multi-component floating-point arithmetic using AVX2

In this paper, we report the results obtained from the acceleration of m...
research
09/12/2018

FINN-R: An End-to-End Deep-Learning Framework for Fast Exploration of Quantized Neural Networks

Convolutional Neural Networks have rapidly become the most successful ma...
research
09/20/2012

Speech Signal Filters based on Soft Computing Techniques: A Comparison

The paper presents a comparison of various soft computing techniques use...
research
03/18/2021

Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case

Deep learning is finding its way into high energy physics by replacing t...
research
02/18/2022

Lightweight Soft Error Resilience for In-Order Cores

Acoustic-sensor-based soft error resilience is particularly promising, s...

Please sign up or login with your details

Forgot password? Click here to reset