On Efficient Multilevel Clustering via Wasserstein Distances

09/19/2019
by   Viet Huynh, et al.
0

We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We propose several variants of this problem, which admit fast optimization algorithms, by exploiting the connection to the problem of finding Wasserstein barycenters. Consistency properties are established for the estimates of both local and global clusters. Finally, the experimental results with both synthetic and real data are presented to demonstrate the flexibility and scalability of the proposed approach.

READ FULL TEXT
research
06/13/2017

Multilevel Clustering via Wasserstein Means

We propose a novel approach to the problem of multilevel clustering, whi...
research
10/29/2018

Probabilistic Multilevel Clustering via Composite Transportation Distance

We propose a novel probabilistic approach to multilevel clustering probl...
research
10/10/2019

On Scalable Variant of Wasserstein Barycenter

We study a variant of Wasserstein barycenter problem, which we refer to ...
research
05/27/2022

Efficient Forecasting of Large Scale Hierarchical Time Series via Multilevel Clustering

We propose a novel approach to the problem of clustering hierarchically ...
research
06/23/2018

Variational Wasserstein Clustering

We propose a new clustering method based on optimal transportation. We s...
research
12/26/2022

Covariance-based soft clustering of functional data based on the Wasserstein-Procrustes metric

We consider the problem of clustering functional data according to their...
research
09/30/2015

Fast Discrete Distribution Clustering Using Wasserstein Barycenter with Sparse Support

In a variety of research areas, the weighted bag of vectors and the hist...

Please sign up or login with your details

Forgot password? Click here to reset