
Fast Margin Maximization via Dual Acceleration
We present and analyze a momentumbased gradient method for training lin...
read it

Earlystopped neural networks are consistent
This work studies the behavior of neural networks trained with the logis...
read it

Generalization bounds via distillation
This paper theoretically investigates the following empirical phenomenon...
read it

Model Generalization on COVID19 Fake News Detection
Amid the pandemic COVID19, the world is facing unprecedented infodemic ...
read it

CrossNER: Evaluating CrossDomain Named Entity Recognition
Crossdomain named entity recognition (NER) models are able to cope with...
read it

Multihop Question Generation with Graph Convolutional Network
Multihop Question Generation (QG) aims to generate answerrelated quest...
read it

Gradient descent follows the regularization path for general losses
Recent work across many machine learning disciplines has highlighted tha...
read it

Directional convergence and alignment in deep learning
In this paper, we show that although the minimizers of crossentropy and...
read it

Neural tangent kernels, transportation mappings, and universal approximation
This paper establishes rates of universal approximation for the shallow ...
read it

Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks
Recent work has revealed that overparameterized networks trained by grad...
read it

Approximation power of random neural networks
This paper investigates the approximation power of three types of random...
read it

A refined primaldual analysis of the implicit bias
Recent work shows that gradient descent on linearly separable data is im...
read it

Gradient descent aligns the layers of deep linear networks
This paper establishes risk convergence and asymptotic weight matrix ali...
read it

Risk and parameter convergence of logistic regression
The logistic loss is strictly convex and does not attain its infimum; co...
read it

Wikidata Vandalism Detection  The Loganberry Vandalism Detector at WSDM Cup 2017
Wikidata is the new, largescale knowledge base of the Wikimedia Foundat...
read it

Social Welfare and Profit Maximization from Revealed Preferences
Consider the seller's problem of finding "optimal" prices for her (divis...
read it
Ziwei Ji
is this you? claim profile