Stochastic EM for Shuffled Linear Regression

04/02/2018
by   Abubakar Abid, et al.
0

We consider the problem of inference in a linear regression model in which the relative ordering of the input features and output labels is not known. Such datasets naturally arise from experiments in which the samples are shuffled or permuted during the protocol. In this work, we propose a framework that treats the unknown permutation as a latent variable. We maximize the likelihood of observations using a stochastic expectation-maximization (EM) approach. We compare this to the dominant approach in the literature, which corresponds to hard EM in our framework. We show on synthetic data that the stochastic EM algorithm we develop has several advantages, including lower parameter error, less sensitivity to the choice of initialization, and significantly better performance on datasets that are only partially shuffled. We conclude by performing two experiments on real datasets that have been partially shuffled, in which we show that the stochastic EM algorithm can recover the weights with modest error.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2018

Global Convergence of EM Algorithm for Mixtures of Two Component Linear Regression

The Expectation-Maximization algorithm is perhaps the most broadly used ...
research
10/12/2018

An Algebraic-Geometric Approach to Shuffled Linear Regression

Shuffled linear regression is the problem of performing a linear regress...
research
05/03/2017

Linear Regression with Shuffled Labels

Is it possible to perform linear regression on datasets whose labels are...
research
05/15/2019

A New Estimation Algorithm for Box-Cox Transformation Cure Rate Model and Comparison With EM Algorithm

In this paper, we develop a new estimation procedure based on the non-li...
research
06/30/2020

Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

We study Sinkhorn EM (sEM), a variant of the expectation maximization (E...
research
06/17/2013

Spectral Experts for Estimating Mixtures of Linear Regressions

Discriminative latent-variable models are typically learned using EM or ...
research
12/29/2021

Time varying regression with hidden linear dynamics

We revisit a model for time-varying linear regression that assumes the u...

Please sign up or login with your details

Forgot password? Click here to reset