We present accumulator-aware quantization (A2Q), a novel weight quantiza...
We introduce a quantization-aware training algorithm that guarantees avo...
The widespread adoption of deep neural networks in computer vision
appli...
Recent advancements in deep reinforcement learning have brought forth an...
GPU compilers are complex software programs with many optimizations spec...
Quantization and pruning are core techniques used to reduce the inferenc...
A novel energy-efficient edge computing paradigm is proposed for real-ti...
The use of Deep Learning hardware algorithms for embedded applications i...
When trained as generative models, Deep Learning algorithms have shown
e...
Stochastic-sampling-based Generative Neural Networks, such as Restricted...
The power budget for embedded hardware implementations of Deep Learning
...