-
Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout
The vast majority of deep models use multiple gradient signals, typicall...
read it
-
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Neural network scaling has been critical for improving the model quality...
read it
-
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Lingvo is a Tensorflow framework offering a complete solution for collab...
read it
-
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism
GPipe is a scalable pipeline parallelism library that enables learning o...
read it
-
Regularized Evolution for Image Classifier Architecture Search
The effort devoted to hand-crafting image classifiers has motivated the ...
read it
-
Learning Efficient Representations for Reinforcement Learning
Markov decision processes (MDPs) are a well studied framework for solvin...
read it
-
Partitioning Large Scale Deep Belief Networks Using Dropout
Deep learning methods have shown great promise in many practical applica...
read it

Yanping Huang
is this you? claim profile