Models derived from other models are extremely common in machine learnin...
Large language models (LLMs) power many state-of-the-art systems in natu...
We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE)
t...
Language models (LMs) are becoming the foundation for almost all major
l...
Resource allocation problems in many computer systems can be formulated ...
Resource allocation problems in many computer systems can be formulated ...
Large language models have led to state-of-the-art accuracies across a r...
We consider the problem of assigning or allocating resources to a set of...
Specialized accelerators such as GPUs, TPUs, FPGAs, and custom ASICs hav...
Many state-of-the-art results in domains such as NLP and computer vision...
Machine learning is experiencing an explosion of software and hardware
s...
Machine learning (ML) has become increasingly important and
performance-...
PipeDream is a Deep Neural Network(DNN) training system for GPUs that
pa...
The deep learning community has proposed optimizations spanning hardware...
Data analytics applications combine multiple functions from different
li...