Outlier-Robust Optimal Transport

12/14/2020
by   Debarghya Mukherjee, et al.
0

Optimal transport (OT) provides a way of measuring distances between distributions that depends on the geometry of the sample space. In light of recent advances in solving the OT problem, OT distances are widely used as loss functions in minimum distance estimation. Despite its prevalence and advantages, however, OT is extremely sensitive to outliers. A single adversarially-picked outlier can increase OT distance arbitrarily. To address this issue, in this work we propose an outlier-robust OT formulation. Our formulation is convex but challenging to scale at a first glance. We proceed by deriving an equivalent formulation based on cost truncation that is easy to incorporate into modern stochastic algorithms for regularized OT. We demonstrate our model applied to mean estimation under the Huber contamination model in simulation as well as outlier detection on real data.

READ FULL TEXT
research
11/02/2021

Outlier-Robust Optimal Transport: Duality, Structure, and Statistical Analysis

The Wasserstein distance, rooted in optimal transport (OT) theory, is a ...
research
06/23/2022

On making optimal transport robust to all outliers

Optimal transport (OT) is known to be sensitive against outliers because...
research
11/01/2022

Meta-Learning for Unsupervised Outlier Detection with Optimal Transport

Automated machine learning has been widely researched and adopted in the...
research
10/18/2022

Multivariate outlier explanations using Shapley values and Mahalanobis distances

For the purpose of explaining multivariate outlyingness, it is shown tha...
research
06/22/2006

Outlier Robust ICP for Minimizing Fractional RMSD

We describe a variation of the iterative closest point (ICP) algorithm f...
research
02/17/2021

Robust Mean Estimation in High Dimensions via Global Outlier Pursuit

We study the robust mean estimation problem in high dimensions, where le...
research
03/03/2020

Online Sinkhorn: optimal transportation distances from sample streams

Optimal Transport (OT) distances are now routinely used as loss function...

Please sign up or login with your details

Forgot password? Click here to reset