Threshold activation functions are highly preferable in neural networks ...
Understanding the fundamental mechanism behind the success of transforme...
Unrolled neural networks have recently achieved state-of-the-art acceler...
Vision transformers using self-attention or its proposed alternatives ha...
Despite several attempts, the fundamental mechanisms behind the success ...
Training deep neural networks is a well-known highly non-convex problem....
Understanding the fundamental mechanism behind the success of deep neura...
Generative Adversarial Networks (GANs) are commonly used for modeling co...
Batch Normalization (BN) is a commonly used technique to accelerate and
...
We describe the convex semi-infinite dual of the two-layer vector-output...
We study training of Convolutional Neural Networks (CNNs) with ReLU
acti...
We develop a convex analytic framework for ReLU neural networks which
el...
We develop exact representations of two layer neural networks with recti...
We study regularized deep neural networks and introduce an analytic fram...
We investigate anomaly detection in an unsupervised framework and introd...