Can Single-Shuffle SGD be Better than Reshuffling SGD and GD?

03/12/2021
by   Chulhee Yun, et al.
7

We propose matrix norm inequalities that extend the Recht-Ré (2012) conjecture on a noncommutative AM-GM inequality by supplementing it with another inequality that accounts for single-shuffle, which is a widely used without-replacement sampling scheme that shuffles only once in the beginning and is overlooked in the Recht-Ré conjecture. Instead of general positive semidefinite matrices, we restrict our attention to positive definite matrices with small enough condition numbers, which are more relevant to matrices that arise in the analysis of SGD. For such matrices, we conjecture that the means of matrix products corresponding to with- and without-replacement variants of SGD satisfy a series of spectral norm inequalities that can be summarized as: "single-shuffle SGD converges faster than random-reshuffle SGD, which is in turn faster than with-replacement SGD." We present theorems that support our conjecture by proving several special cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2020

Recht-Ré Noncommutative Arithmetic-Geometric Mean Conjecture is False

Stochastic optimization algorithms have become indispensable in modern m...
research
06/12/2021

Random Shuffling Beats SGD Only After Many Epochs on Ill-Conditioned Problems

Recently, there has been much interest in studying the convergence rates...
research
05/31/2023

A family of Counterexamples on Inequality among Symmetric Functions

Inequalities among symmetric functions are fundamental questions in math...
research
04/18/2020

On Tight Convergence Rates of Without-replacement SGD

For solving finite-sum optimization problems, SGD without replacement sa...
research
06/26/2018

Random Shuffling Beats SGD after Finite Epochs

A long-standing problem in the theory of stochastic gradient descent (SG...
research
09/04/2023

Partial Proof of a Conjecture with Implications for Spectral Majorization

In this paper we report on new results relating to a conjecture regardin...
research
08/28/2018

Exponential inequality for chaos based on sampling without replacement

We are interested in the behavior of particular functionals, in a framew...

Please sign up or login with your details

Forgot password? Click here to reset