Following the success of GPT4, there has been a surge in interest in
mul...
With the ever-growing model size and the limited availability of labeled...
While transferring a pretrained language model, common approaches
conven...
While cross entropy (CE) is the most commonly used loss to train deep ne...
When training overparameterized deep networks for classification tasks, ...
When training deep neural networks for classification tasks, an intrigui...
Recently, over-parameterized deep networks, with increasingly more netwo...
We study the robust recovery of a low-rank matrix from sparsely and gros...
We provide the first global optimization landscape analysis of
Neural Co...
Normalization techniques have become a basic component in modern
convolu...
As science and engineering have become increasingly data-driven, the rol...
Recent advances have shown that implicit bias of gradient descent on
ove...
The problem of finding the sparsest vector (direction) in a low dimensio...
We study nonconvex optimization landscapes for learning overcomplete
rep...
Nonsmooth Riemannian optimization is a still under explored subfield of
...
Short-and-sparse deconvolution (SaSD) is the problem of extracting local...
We study the multi-channel sparse blind deconvolution (MCS-BD) problem, ...
We study the convolutional phase retrieval problem, which considers
reco...
Can we recover a complex signal from its Fourier magnitudes? More genera...
We consider the problem of recovering a complete (i.e., square and
inver...
We consider the problem of recovering a complete (i.e., square and
inver...
In this note, we focus on smooth nonconvex optimization problems that ob...
We consider the problem of recovering a complete (i.e., square and
inver...
Is it possible to find the sparsest vector (direction) in a generic subs...
Pixel-wise classification, where each pixel is assigned to a predefined
...