NVIDIA Tensor Core is a mixed-precision matrix-matrix multiplication and...
Formula-driven supervised learning (FDSL) is a pre-training method that
...
Deep learning hardware achieves high throughput and low power consumptio...
Gradient preconditioning is a key technique to integrate the second-orde...
Random projection can reduce the dimension of data while capturing its
s...
Quantum circuit simulation provides the foundation for the development o...
Formula-driven supervised learning (FDSL) has been shown to be an effect...
Among various supervised deep metric learning methods proxy-based approa...
Modern deep learning systems are fragile and do not generalize well unde...
Factorization of large dense matrices are ubiquitous in engineering and ...
We present two new algorithms for Householder QR factorization of Block
...
In the present work, we show that the performance of formula-driven
supe...
Tensor Core is a mixed-precision matrix-matrix multiplication unit on NV...
Inverse Reinforcement Learning (IRL) is attractive in scenarios where re...
The use of iterative pose refinement is a critical processing step for 6...
This paper describes a viewpoint-robust object-based change detection ne...
Large-scale distributed training of deep neural networks results in mode...
Hierarchical Matrix (H-matrix) is an approximation technique which split...
Bayesian methods promise to fix many shortcomings of deep learning, but ...
Large-scale distributed training of deep neural networks suffer from the...
Algorithmic and architecture-oriented optimizations are essential for
ac...