It has been widely observed in training of neural networks that when app...
We introduce ProxSkip – a surprisingly simple and provably
efficient met...
While SGD, which samples from the data with replacement is widely studie...
We present the first accelerated randomized algorithm for solving linear...
We consider a stochastic continuum armed bandit problem where the arms a...