Optimistic Adaptive Acceleration for Optimization

03/04/2019

∙

We consider a new variant of AMSGrad. AMSGrad RKK18 is a popular adaptive gradient based optimization algorithm that is widely used in training deep neural networks. Our new variant of the algorithm assumes that mini-batch gradients in consecutive iterations have some underlying structure, which makes the gradients sequentially predictable. By exploiting the predictability and some ideas from the field of Optimistic Online learning, the new algorithm can accelerate the convergence and enjoy a tighter regret bound. We conduct experiments on training various neural networks on several datasets to show that the proposed method speeds up the convergence in practice.

READ FULL TEXT

Optimistic Adaptive Acceleration for Optimization

Sign in with Google

Consider DeepAI Pro