Convolution is the most time-consuming operation in deep neural network
...
Understanding application resilience (or error tolerance) in the presenc...
Scaling supercomputers comes with an increase in failure rates due to th...
MPI has been ubiquitously deployed in flagship HPC systems aiming to
acc...
Tensor algebra is widely used in many applications, such as scientific
c...
Extreme-scale scientific applications can be more vulnerable to soft err...
As high-performance computing systems scale in size and computational po...