Implicit Bias of Gradient Descent on Reparametrized Models: On Equivalence to Mirror Descent

07/08/2022
by   Zhiyuan Li, et al.
8

As part of the effort to understand implicit bias of gradient descent in overparametrized models, several results have shown how the training trajectory on the overparametrized model can be understood as mirror descent on a different objective. The main result here is a characterization of this phenomenon under a notion termed commuting parametrization, which encompasses all the previous results in this setting. It is shown that gradient flow with any commuting parametrization is equivalent to continuous mirror descent with a related Legendre function. Conversely, continuous mirror descent with any Legendre function can be viewed as gradient flow with a related commuting parametrization. The latter result relies upon Nash's embedding theorem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2018

Secondary gradient descent in higher codimension

In this paper, we analyze discrete gradient descent and ϵ-noisy gradient...
research
02/06/2023

Rethinking Gauss-Newton for learning over-parameterized models

Compared to gradient descent, Gauss-Newton's method (GN) and variants ar...
research
02/09/2022

On the Implicit Bias of Gradient Descent for Temporal Extrapolation

Common practice when using recurrent neural networks (RNNs) is to apply ...
research
05/26/2022

Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures

This paper considers the Pointer Value Retrieval (PVR) benchmark introdu...
research
11/27/2020

Deep orthogonal linear networks are shallow

We consider the problem of training a deep orthogonal linear network, wh...
research
10/08/2021

On the Implicit Biases of Architecture Gradient Descent

Do neural networks generalise because of bias in the functions returned ...
research
03/05/2021

Autocalibration and Tweedie-dominance for Insurance Pricing with Machine Learning

Boosting techniques and neural networks are particularly effective machi...

Please sign up or login with your details

Forgot password? Click here to reset