Learning Mixtures of Linear Regressions with Nearly Optimal Complexity

02/22/2018
by   Yuanzhi Li, et al.
0

Mixtures of Linear Regressions (MLR) is an important mixture model with many applications. In this model, each observation is generated from one of the several unknown linear regression components, where the identity of the generated component is also unknown. Previous works either assume strong assumptions on the data distribution or have high complexity. This paper proposes a fixed parameter tractable algorithm for the problem under general conditions, which achieves global convergence and the sample complexity scales nearly linearly in the dimension. In particular, different from previous works that require the data to be from the standard Gaussian, the algorithm allows the data from Gaussians with different covariances. When the conditional number of the covariances and the number of components are fixed, the algorithm has nearly optimal sample complexity N = Õ(d) as well as nearly optimal computational complexity Õ(Nd), where d is the dimension of the data space. To the best of our knowledge, this approach provides the first such recovery guarantee for this general setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2014

Max vs Min: Tensor Decomposition and ICA with nearly Linear Sample Complexity

We present a simple, general technique for reducing the sample complexit...
research
08/16/2022

Private Estimation with Public Data

We initiate the study of differentially private (DP) estimation with acc...
research
10/04/2016

The Search Problem in Mixture Models

We consider the task of learning the parameters of a single component o...
research
02/24/2022

On Learning Mixture Models with Sparse Parameters

Mixture models are widely used to fit complex and multimodal datasets. I...
research
06/14/2019

Disentangling Mixtures of Epidemics on Graphs

We consider the problem of learning the weighted edges of a mixture of t...
research
08/19/2016

Solving a Mixture of Many Random Linear Equations by Tensor Decomposition and Alternating Minimization

We consider the problem of solving mixed random linear equations with k ...
research
10/14/2022

Provable Subspace Identification Under Post-Nonlinear Mixtures

Unsupervised mixture learning (UML) aims at identifying linearly or nonl...

Please sign up or login with your details

Forgot password? Click here to reset