
-
Deformable DETR: Deformable Transformers for End-to-End Object Detection
DETR has been recently proposed to eliminate the need for many hand-desi...
read it
-
Benign Overfitting and Noisy Features
Modern machine learning often operates in the regime where the number of...
read it
-
Robust Learning Rate Selection for Stochastic Optimization via Splitting Diagnostic
This paper proposes SplitSGD, a new stochastic optimization algorithm wi...
read it
-
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
We introduce a new pre-trainable generic representation for visual-lingu...
read it
-
Algorithmic Analysis and Statistical Estimation of SLOPE via Approximate Message Passing
SLOPE is a relatively new convex optimization procedure for high-dimensi...
read it
-
Quantifying Intrinsic Uncertainty in Classification via Deep Dirichlet Mixture Networks
With the widespread success of deep neural networks in science and techn...
read it
-
Statistical Inference for Online Learning and Stochastic Approximation via Hierarchical Incremental Gradient Descent
Stochastic gradient descent (SGD) is an immensely popular approach for o...
read it
-
Statistical Inference for the Population Landscape via Moment Adjusted Stochastic Gradients
Modern statistical inference tasks often require iterative optimization ...
read it
-
When Does the First Spurious Variable Get Selected by Sequential Regression Procedures?
Applied statisticians use sequential regression procedures to produce a ...
read it
-
Private False Discovery Rate Control
We provide the first differentially private algorithms for controlling t...
read it
-
False Discoveries Occur Early on the Lasso Path
In regression settings where explanatory variables have very low correla...
read it
-
Communication-Efficient False Discovery Rate Control via Knockoff Aggregation
The false discovery rate (FDR)---the expected fraction of spurious disco...
read it
-
A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights
We derive a second-order ordinary differential equation (ODE) which is t...
read it