The Nesterov's accelerated quasi-Newton (L)NAQ method has shown to accel...
Recent studies incorporate Nesterov's accelerated gradient method for th...
Recently algorithms incorporating second order curvature information hav...
Incorporating second order curvature information in gradient based metho...
A common problem in training neural networks is the vanishing and/or
exp...