
-
Particle Dual Averaging: Optimization of Mean Field Neural Networks with Global Convergence Rate Analysis
We propose the particle dual averaging (PDA) method, which generalizes t...
read it
-
When Does Preconditioning Help or Hurt Generalization?
While second order optimizers such as natural gradient descent (NGD) oft...
read it
-
On the Optimal Weighted ℓ_2 Regularization in Overparameterized Linear Regression
We consider the linear model 𝐲 = 𝐗β_⋆ + ϵ with 𝐗∈ℝ^n× p in the overparam...
read it
-
Stochastic Runge-Kutta Accelerates Langevin Monte Carlo and Beyond
Sampling with Markov chain Monte Carlo methods typically amounts to disc...
read it
-
Modeling the Biological Pathology Continuum with HSIC-regularized Wasserstein Auto-encoders
A crucial challenge in image-based modeling of biomedical data is to ide...
read it
-
Post Selection Inference with Incomplete Maximum Mean Discrepancy Estimator
Measuring divergence between two distributions is essential in machine l...
read it
-
Selecting the Best in GANs Family: a Post Selection Inference Framework
"Which Generative Adversarial Networks (GANs) generates the most plausib...
read it
-
"Dependency Bottleneck" in Auto-encoding Architectures: an Empirical Study
Recent works investigated the generalization properties in deep neural n...
read it