Randomized K-FACs: Speeding up K-FAC with Randomized Numerical Linear Algebra

06/30/2022
by   Constantin Octavian Puiu, et al.
0

K-FAC is a successful tractable implementation of Natural Gradient for Deep Learning, which nevertheless suffers from the requirement to compute the inverse of the Kronecker factors (through an eigen-decomposition). This can be very time-consuming (or even prohibitive) when these factors are large. In this paper, we theoretically show that, owing to the exponential-average construction paradigm of the Kronecker factors that is typically used, their eigen-spectrum must decay. We show numerically that in practice this decay is very rapid, leading to the idea that we could save substantial computation by only focusing on the first few eigen-modes when inverting the Kronecker-factors. Randomized Numerical Linear Algebra provides us with the necessary tools to do so. Numerical results show we obtain ≈2.5× reduction in per-epoch time and ≈3.3× reduction in time to target accuracy. We compare our proposed K-FAC sped-up versions with a more computationally efficient NG implementation, SENG, and observe we perform on par with it.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2022

Brand New K-FACs: Speeding up K-FAC with Online Decomposition Updates

K-FAC (arXiv:1503.05671, arXiv:1602.01407) is a tractable implementation...
research
12/24/2017

Lectures on Randomized Numerical Linear Algebra

This chapter is based on lectures on Randomized Numerical Linear Algebra...
research
04/29/2021

Photonic co-processors in HPC: using LightOn OPUs for Randomized Numerical Linear Algebra

Randomized Numerical Linear Algebra (RandNLA) is a powerful class of met...
research
05/19/2023

A randomized algorithm for the QR decomposition-based approximate SVD

Matrix decomposition is a very important mathematical tool in numerical ...
research
01/13/2021

A Tail Estimate with Exponential Decay for the Randomized Incremental Construction of Search Structures

We revisit the randomized incremental construction of the Trapezoidal Se...
research
11/22/2021

A Novel Randomized XR-Based Preconditioned CholeskyQR Algorithm

CholeskyQR is a simple and fast QR decomposition via Cholesky decomposit...

Please sign up or login with your details

Forgot password? Click here to reset