Proximal Mean Field Learning in Shallow Neural Networks

10/25/2022
by   Alexis Teter, et al.
0

Recent mean field interpretations of learning dynamics in over-parameterized neural networks offer theoretical insights on the empirical success of first order optimization algorithms in finding global minima of the nonconvex risk landscape. In this paper, we explore applying mean field learning dynamics as a computational algorithm, rather than as an analytical tool. Specifically, we design a Sinkhorn regularized proximal algorithm to approximate the distributional flow from the learning dynamics in the mean field regime over weighted point clouds. In this setting, a contractive fixed point recursion computes the time-varying weights, numerically realizing the interacting Wasserstein gradient flow of the parameter distribution supported over the neuronal ensemble. An appealing aspect of the proposed algorithm is that the measure-valued recursions allow meshless computation. We demonstrate the proposed computational framework of interacting weighted particle evolution on binary and multi-class classification. Our algorithm performs gradient descent of the free energy associated with the risk functional.

READ FULL TEXT
research
12/06/2022

Uniform-in-Time Propagation of Chaos for Mean Field Langevin Dynamics

We study the uniform-in-time propagation of chaos for mean field Langevi...
research
01/25/2022

Convex Analysis of the Mean Field Langevin Dynamics

As an example of the nonlinear Fokker-Planck equation, the mean field La...
research
02/12/2023

Mean Field Optimization Problem Regularized by Fisher Information

Recently there is a rising interest in the research of mean field optimi...
research
12/11/2018

Proximal Mean-field for Neural Network Quantization

Compressing large neural networks by quantizing the parameters, while ma...
research
06/05/2021

The Four Levels of Fixed-Points in Mean-Field Models

The fixed-point analysis refers to the study of fixed-points that arise ...
research
05/14/2022

Trajectory Inference via Mean-field Langevin in Path Space

Trajectory inference aims at recovering the dynamics of a population fro...
research
01/09/2023

On adversarial robustness and the use of Wasserstein ascent-descent dynamics to enforce it

We propose iterative algorithms to solve adversarial problems in a varie...

Please sign up or login with your details

Forgot password? Click here to reset