A survey of deep learning optimizers-first and second order methods

11/28/2022
by   Rohan V Kashyap, et al.
0

Deep Learning optimization involves minimizing a high-dimensional loss function in the weight space which is often perceived as difficult due to its inherent difficulties such as saddle points, local minima, ill-conditioning of the Hessian and limited compute resources. In this paper, we provide a comprehensive review of 12 standard optimization methods successfully used in deep learning research and a theoretical assessment of the difficulties in numerical optimization from the optimization literature.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset