In today's machine learning (ML) models, any part of the training data c...
Pruning is a popular technique for reducing the model size and computati...
We revisit the design choices in Transformers, and propose methods to ad...
Cloud applications are increasingly shifting to interactive and
loosely-...
Neural network robustness has become a central topic in machine learning...
Cloud applications are increasingly shifting from large monolithic servi...
The minibatch stochastic gradient descent method (SGD) is widely applied...
This paper proposes GuardNN, a secure deep neural network (DNN) accelera...
In this paper, we propose MgX, a near-zero overhead memory protection sc...
We propose precision gating (PG), an end-to-end trainable dynamic
dual-p...
Employing deep neural networks to obtain state-of-the-art performance on...