Shampoo is an online and stochastic optimization algorithm belonging to ...
Recommendation models are very large, requiring terabytes (TB) of memory...
A key characteristic of deep recommendation models is the immense memory...
We explore a novel approach for building DNN training clusters using
com...
Neural architecture search (NAS) methods aim to automatically find the
o...
Deep Learning Recommendation Models (DLRM) are widespread, account for a...
Deep learning recommendation models (DLRMs) are used across many
busines...
Checkpoints play an important role in training recommendation systems at...
Large-scale training is important to ensure high performance and accurac...
Ensemble learning is a very prevalent method employed in machine learnin...
Personalized recommendation systems leverage deep learning models and ac...
In many real-world applications, e.g. recommendation systems, certain it...
Modern deep learning-based recommendation systems exploit hundreds to
th...
The widespread application of deep learning has changed the landscape of...
With the advent of deep learning, neural network-based recommendation mo...
This paper presents the first comprehensive empirical study demonstratin...
In this paper, we propose a Distributed Accumulated Newton Conjugate gra...
The standard L-BFGS method relies on gradient approximations that are no...
The state-of-the-art (SOTA) for mixed precision training is dominated by...
The exponential growth in use of large deep neural networks has accelera...
Imitation learning algorithms learn viable policies by imitating an expe...
Sub-8-bit representation of DNNs incur some discernible loss of accuracy...
We propose a novel fine-grained quantization (FGQ) method to ternarize
p...
We propose a cluster-based quantization method to convert pre-trained fu...
Many high performance-computing algorithms are bandwidth limited, hence ...