Mixed Regression via Approximate Message Passing

04/05/2023
by   Nelvin Tan, et al.
0

We study the problem of regression in a generalized linear model (GLM) with multiple signals and latent variables. This model, which we call a matrix GLM, covers many widely studied problems in statistical learning, including mixed linear regression, max-affine regression, and mixture-of-experts. In mixed linear regression, each observation comes from one of L signal vectors (regressors), but we do not know which one; in max-affine regression, each observation comes from the maximum of L affine functions, each defined via a different signal vector. The goal in all these problems is to estimate the signals, and possibly some of the latent variables, from the observations. We propose a novel approximate message passing (AMP) algorithm for estimation in a matrix GLM and rigorously characterize its performance in the high-dimensional limit. This characterization is in terms of a state evolution recursion, which allows us to precisely compute performance measures such as the asymptotic mean-squared error. The state evolution characterization can be used to tailor the AMP algorithm to take advantage of any structural information known about the signals. Using state evolution, we derive an optimal choice of AMP `denoising' functions that minimizes the estimation error in each iteration. The theoretical results are validated by numerical simulations for mixed linear regression, max-affine regression, and mixture-of-experts. For max-affine regression, we propose an algorithm that combines AMP with expectation-maximization to estimate intercepts of the model along with the signals. The numerical results show that AMP significantly outperforms other estimators for mixed linear regression and max-affine regression in most parameter regimes.

READ FULL TEXT
research
09/15/2023

Bayes-Optimal Estimation in Generalized Linear Models via Spatial Coupling

We consider the problem of signal estimation in a generalized linear mod...
research
11/21/2022

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models

In a mixed generalized linear model, the objective is to learn multiple ...
research
06/09/2023

Bayes optimal learning in high-dimensional linear regression with network side information

Supervised learning problems with side information in the form of a netw...
research
05/10/2019

Analysis of Approximate Message Passing with Non-Separable Denoisers and Markov Random Field Priors

Approximate message passing (AMP) is a class of low-complexity, scalable...
research
07/02/2021

Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing Algorithm

Sparse Group LASSO (SGL) is a regularized model for high-dimensional lin...
research
01/27/2021

The fundamental limits of sparse linear regression with sublinear sparsity

We establish exact asymptotic expressions for the normalized mutual info...
research
01/02/2018

Performance Limits with Additive Error Metrics in Noisy Multi-Measurement Vector Problem

Real-world applications such as magnetic resonance imaging with multiple...

Please sign up or login with your details

Forgot password? Click here to reset