We introduce the notion of a Patch Sampling Schedule (PSS), that varies ...
Weight pruning is a technique to make Deep Neural Network (DNN) inferenc...
Block Floating Point (BFP) can efficiently support quantization for Deep...
We present a novel technique, called Term Revealing (TR), for furthering...
We present a full-stack optimization framework for accelerating inferenc...
This paper describes a novel approach of packing sparse convolutional ne...
We propose the use of incomplete dot products (IDP) to dynamically adjus...
We study embedded Binarized Neural Networks (eBNNs) with the aim of allo...
Deep neural networks are state of the art methods for many learning task...
We propose distributed deep neural networks (DDNNs) over distributed
com...