Multilevel Clustering via Wasserstein Means

06/13/2017
by   Nhat Ho, et al.
0

We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We propose a number of variants of this problem, which admit fast optimization algorithms, by exploiting the connection to the problem of finding Wasserstein barycenters. Consistency properties are established for the estimates of both local and global clusters. Finally, experiment results with both synthetic and real data are presented to demonstrate the flexibility and scalability of the proposed approach.

READ FULL TEXT
research
09/19/2019

On Efficient Multilevel Clustering via Wasserstein Distances

We propose a novel approach to the problem of multilevel clustering, whi...
research
10/29/2018

Probabilistic Multilevel Clustering via Composite Transportation Distance

We propose a novel probabilistic approach to multilevel clustering probl...
research
10/10/2019

On Scalable Variant of Wasserstein Barycenter

We study a variant of Wasserstein barycenter problem, which we refer to ...
research
05/27/2022

Efficient Forecasting of Large Scale Hierarchical Time Series via Multilevel Clustering

We propose a novel approach to the problem of clustering hierarchically ...
research
09/09/2021

On the use of Wasserstein metric in topological clustering of distributional data

This paper deals with a clustering algorithm for histogram data based on...
research
12/26/2022

Covariance-based soft clustering of functional data based on the Wasserstein-Procrustes metric

We consider the problem of clustering functional data according to their...
research
10/22/2021

Clustering Market Regimes using the Wasserstein Distance

The problem of rapid and automated detection of distinct market regimes ...

Please sign up or login with your details

Forgot password? Click here to reset