Inspired by the remarkable success of deep neural networks, there has be...
Statistical learning theory and high dimensional statistics have had a
t...
Motivated by the striking ability of transformers for in-context learnin...
Modern machine learning applications have seen a remarkable success of
o...
Existing analyses of neural network training often operate under the
unr...
In this paper, we leverage the rapid advances in imitation learning, a t...
While deep neural networks are capable of achieving state-of-the-art
per...
Driven by the empirical success and wide use of deep neural networks,
un...
Most existing analyses of (stochastic) gradient descent rely on the cond...
We initiate a formal study of reproducibility in optimization. We define...
We investigate approximation guarantees provided by logistic regression ...
We study the non-convex matrix factorization approach to matrix completi...
Conventional wisdom in the sampling literature, backed by a popular diff...
We propose a new discretization of the mirror-Langevin diffusion and giv...
Strong refutation of random CSPs is a fundamental question in theoretica...
We study without-replacement SGD for solving finite-sum optimization
pro...
The proximal point method (PPM) is a fundamental method in optimization ...
For solving finite-sum optimization problems, SGD without replacement
sa...
We propose the first global accelerated gradient method for Riemannian
m...
Spectral clustering is a celebrated algorithm that partitions objects ba...
The maximum matching width is a graph width parameter that is defined on...
Community recovery is a central problem that arises in a wide variety of...