Reweighting samples under covariate shift using a Wasserstein distance criterion

10/19/2020
by   Julien Reygner, et al.
0

Considering two random variables with different laws to which we only have access through finite size iid samples, we address how to reweight the first sample so that its empirical distribution converges towards the true law of the second sample as the size of both samples goes to infinity. We study an optimal reweighting that minimizes the Wasserstein distance between the empirical measures of the two samples, and leads to an expression of the weights in terms of Nearest Neighbors. The consistency and some asymptotic convergence rates in terms of expected Wasserstein distance are derived, and do not need the assumption of absolute continuity of one random variable with respect to the other. These results have some application in Uncertainty Quantification for decoupled estimation and in the bound of the generalization error for the Nearest Neighbor Regression under covariate shift.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2020

Exact rate of convergence of the mean Wasserstein distance between the empirical and true Gaussian distribution

We study the Wasserstein distance W_2 for Gaussian samples. We establish...
research
11/05/2021

Why the 1-Wasserstein distance is the area between the two marginal CDFs

We elucidate why the 1-Wasserstein distance W_1 coincides with the area ...
research
10/18/2022

Bagged k-Distance for Mode-Based Clustering Using the Probability of Localized Level Sets

In this paper, we propose an ensemble learning algorithm named bagged k-...
research
06/10/2021

Distributionally Robust Prescriptive Analytics with Wasserstein Distance

In prescriptive analytics, the decision-maker observes historical sample...
research
05/10/2021

Budget-limited distribution learning in multifidelity problems

Multifidelity methods are widely used for statistical estimation of quan...
research
12/02/2022

Stable Learning via Sparse Variable Independence

The problem of covariate-shift generalization has attracted intensive re...

Please sign up or login with your details

Forgot password? Click here to reset