Accelerating Natural Gradient with Higher-Order Invariance

03/04/2018
by   Yang Song, et al.
0

An appealing property of the natural gradient is that it is invariant to arbitrary differentiable reparameterizations of the model. However, this invariance property requires infinitesimal steps and is lost in practical implementations with small but finite step sizes. In this paper, we study invariance properties from a combined perspective of Riemannian geometry and numerical differential equation solving. We define the order of invariance of a numerical method to be its convergence order to an invariant solution. We propose to use higher-order integrators and corrections based on geodesics to obtain more invariant optimization trajectories. We prove the numerical convergence properties of geodesic corrected updates and show that they can be as computational efficient as plain natural gradient. Experimentally, we demonstrate that invariance leads to faster training and our techniques improve on traditional natural gradient in optimizing synthetic objectives as well as deep classifiers and autoencoders.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2022

Invariance Properties of the Natural Gradient in Overparametrised Systems

The natural gradient field is a vector field that lives on a model equip...
research
02/14/2023

The Geometry of Neural Nets' Parameter Spaces Under Reparametrization

Model reparametrization – transforming the parameter space via a bijecti...
research
08/30/2018

A Coordinate-Free Construction of Scalable Natural Gradient

Most neural networks are trained using first-order optimization methods,...
research
12/09/2021

A More Stable Accelerated Gradient Method Inspired by Continuous-Time Perspective

Nesterov's accelerated gradient method (NAG) is widely used in problems ...
research
06/12/2020

Traversal-invariant characterizations of logarithmic space

We give a novel descriptive-complexity theoretic characterization of L a...
research
06/07/2022

Distributionally Invariant Learning: Rationalization and Practical Algorithms

The invariance property across environments is at the heart of invariant...
research
02/22/2023

A study on the invariance in security whatever the dimension of images for the steganalysis by deep-learning

In this paper, we study the performance invariance of convolutional neur...

Please sign up or login with your details

Forgot password? Click here to reset