Mean-field methods and algorithmic perspectives for high-dimensional machine learning

03/10/2021
by   Benjamin Aubin, et al.
0

The main difficulty that arises in the analysis of most machine learning algorithms is to handle, analytically and numerically, a large number of interacting random variables. In this Ph.D manuscript, we revisit an approach based on the tools of statistical physics of disordered systems. Developed through a rich literature, they have been precisely designed to infer the macroscopic behavior of a large number of particles from their microscopic interactions. At the heart of this work, we strongly capitalize on the deep connection between the replica method and message passing algorithms in order to shed light on the phase diagrams of various theoretical models, with an emphasis on the potential differences between statistical and algorithmic thresholds. We essentially focus on synthetic tasks and data generated in the teacher-student paradigm. In particular, we apply these mean-field methods to the Bayes-optimal analysis of committee machines, to the worst-case analysis of Rademacher generalization bounds for perceptrons, and to empirical risk minimization in the context of generalized linear models. Finally, we develop a framework to analyze estimation models with structured prior informations, produced for instance by deep neural networks based generative models with random weights.

READ FULL TEXT
research
11/03/2019

Mean-field inference methods for neural networks

Machine learning algorithms relying on deep neural networks recently all...
research
11/23/2020

Restricted Boltzmann Machine, recent advances and mean-field theory

This review deals with Restricted Boltzmann Machine (RBM) under the ligh...
research
06/20/2019

High-temperature Expansions and Message Passing Algorithms

Improved mean-field technics are a central theme of statistical physics ...
research
07/08/2019

Mean field models for large data-clustering problems

We consider mean-field models for data--clustering problems starting fro...
research
06/11/2020

Asymptotic Errors for Teacher-Student Convex Generalized Linear Models (or : How to Prove Kabashima's Replica Formula)

There has been a recent surge of interest in the study of asymptotic rec...
research
05/15/2017

A statistical physics approach to learning curves for the Inverse Ising problem

Using methods of statistical physics, we analyse the error of learning c...
research
09/13/2021

On the regularized risk of distributionally robust learning over deep neural networks

In this paper we explore the relation between distributionally robust le...

Please sign up or login with your details

Forgot password? Click here to reset