Permuted and Unlinked Monotone Regression in ℝ^d: an approach based on mixture modeling and optimal transport

01/10/2022
by   Martin Slawski, et al.
0

Suppose that we have a regression problem with response variable Y in ℝ^d and predictor X in ℝ^d, for d ≥ 1. In permuted or unlinked regression we have access to separate unordered data on X and Y, as opposed to data on (X,Y)-pairs in usual regression. So far in the literature the case d=1 has received attention, see e.g., the recent papers by Rigollet and Weed [Information Inference, 8, 619–717] and Balabdaoui et al. [J. Mach. Learn. Res., 22(172), 1–60]. In this paper, we consider the general multivariate setting with d ≥ 1. We show that the notion of cyclical monotonicity of the regression function is sufficient for identification and estimation in the permuted/unlinked regression model. We study permutation recovery in the permuted regression setting and develop a computationally efficient and easy-to-use algorithm for denoising based on the Kiefer-Wolfowitz [Ann. Math. Statist., 27, 887–906] nonparametric maximum likelihood estimator and techniques from the theory of optimal transport. We provide explicit upper bounds on the associated mean squared denoising error for Gaussian noise. As in previous work on the case d = 1, the permuted/unlinked setting involves slow (logarithmic) rates of convergence rooting in the underlying deconvolution problem. Numerical studies corroborate our theoretical analysis and show that the proposed approach performs at least on par with the methods in the aforementioned prior work in the case d = 1 while achieving substantial reductions in terms of computational complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2022

On the sample complexity of entropic optimal transport

We study the sample complexity of entropic optimal transport in high dim...
research
09/14/2018

Entropic optimal transport is maximum-likelihood deconvolution

We give a statistical interpretation of entropic optimal transport by sh...
research
06/27/2018

Uncoupled isotonic regression via minimum Wasserstein deconvolution

Isotonic regression is a standard problem in shape-constrained estimatio...
research
04/19/2021

Distribution-on-Distribution Regression via Optimal Transport Maps

We present a framework for performing regression when both covariate and...
research
12/04/2021

Nonparametric mixture MLEs under Gaussian-smoothed optimal transport distance

The Gaussian-smoothed optimal transport (GOT) framework, pioneered in Go...
research
07/13/2022

Linear regression with unmatched data: a deconvolution perspective

Consider the regression problem where the response Y∈ℝ and the covariate...
research
04/24/2017

Denoising Linear Models with Permuted Data

The multivariate linear regression model with shuffled data and additive...

Please sign up or login with your details

Forgot password? Click here to reset