research
∙
05/20/2021
Logarithmic landscape and power-law escape rate of SGD
Stochastic gradient descent (SGD) undergoes complicated multiplicative n...
research
∙
02/10/2021
On Minibatch Noise: Discrete-Time SGD, Overparametrization, and Bayes
The noise in stochastic gradient descent (SGD), caused by minibatch samp...
research
∙
12/07/2020