Approximating 1-Wasserstein Distance with Trees

06/24/2022
by   Makoto Yamada, et al.
9

Wasserstein distance, which measures the discrepancy between distributions, shows efficacy in various types of natural language processing (NLP) and computer vision (CV) applications. One of the challenges in estimating Wasserstein distance is that it is computationally expensive and does not scale well for many distribution comparison tasks. In this paper, we aim to approximate the 1-Wasserstein distance by the tree-Wasserstein distance (TWD), where TWD is a 1-Wasserstein distance with tree-based embedding and can be computed in linear time with respect to the number of nodes on a tree. More specifically, we propose a simple yet efficient L1-regularized approach to learning the weights of the edges in a tree. To this end, we first show that the 1-Wasserstein approximation problem can be formulated as a distance approximation problem using the shortest path distance on a tree. We then show that the shortest path distance can be represented by a linear model and can be formulated as a Lasso-based regression problem. Owing to the convex formulation, we can obtain a globally optimal solution efficiently. Moreover, we propose a tree-sliced variant of these methods. Through experiments, we demonstrated that the weighted TWD can accurately approximate the original 1-Wasserstein distance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2021

Fixed Support Tree-Sliced Wasserstein Barycenter

The Wasserstein barycenter has been widely studied in various fields, in...
research
01/27/2021

Supervised Tree-Wasserstein Distance

To measure the similarity of documents, the Wasserstein distance is a po...
research
01/17/2018

Brenier approach for optimal transportation between a quasi-discrete measure and a discrete measure

Correctly estimating the discrepancy between two data distributions has ...
research
12/28/2018

Hybrid Wasserstein Distance and Fast Distribution Clustering

We define a modified Wasserstein distance for distribution clustering wh...
research
03/01/2021

Computationally Efficient Wasserstein Loss for Structured Labels

The problem of estimating the probability distribution of labels has bee...
research
10/10/2019

Computationally Efficient Tree Variants of Gromov-Wasserstein

We propose two novel variants of Gromov-Wasserstein (GW) between probabi...
research
01/09/2021

An Optimization Framework for Power Infrastructure Planning

The ubiquitous expansion and transformation of the energy supply system ...

Please sign up or login with your details

Forgot password? Click here to reset