Learning to Initialize Gradient Descent Using Gradient Descent

12/22/2020
by   Kartik Ahuja, et al.
0

Non-convex optimization problems are challenging to solve; the success and computational expense of a gradient descent algorithm or variant depend heavily on the initialization strategy. Often, either random initialization is used or initialization rules are carefully designed by exploiting the nature of the problem class. As a simple alternative to hand-crafted initialization rules, we propose an approach for learning "good" initialization rules from previous solutions. We provide theoretical guarantees that establish conditions that are sufficient in all cases and also necessary in some under which our approach performs better than random initialization. We apply our methodology to various non-convex problems such as generating adversarial examples, generating post hoc explanations for black-box machine learning models, and allocating communication spectrum, and show consistent gains over other initialization techniques.

READ FULL TEXT
research
11/29/2022

Mirror descent of Hopfield model

Mirror descent is a gradient descent method that uses a dual space of pa...
research
01/28/2022

Improving Group Testing via Gradient Descent

We study the problem of group testing with non-identical, independent pr...
research
05/29/2017

Gradient Descent Can Take Exponential Time to Escape Saddle Points

Although gradient descent (GD) almost always escapes saddle points asymp...
research
12/02/2020

An algorithm for non-convex off-the-grid sparse spike estimation with a minimum separation constraint

Theoretical results show that sparse off-the-grid spikes can be estimate...
research
07/10/2018

Parallax Bundle Adjustment on Manifold with Convexified Initialization

Bundle adjustment (BA) with parallax angle based feature parameterizatio...
research
03/08/2018

Reptile: a Scalable Metalearning Algorithm

This paper considers metalearning problems, where there is a distributio...
research
04/04/2023

Machine Learning Discovery of Optimal Quadrature Rules for Isogeometric Analysis

We propose the use of machine learning techniques to find optimal quadra...

Please sign up or login with your details

Forgot password? Click here to reset