Optimal Transport vs. Fisher-Rao distance between Copulas for Clustering Multivariate Time Series

by   Gautier Marti, et al.
ENS Lyon
Hellebore Capital Ltd

We present a methodology for clustering N objects which are described by multivariate time series, i.e. several sequences of real-valued random variables. This clustering methodology leverages copulas which are distributions encoding the dependence structure between several random variables. To take fully into account the dependence information while clustering, we need a distance between copulas. In this work, we compare renowned distances between distributions: the Fisher-Rao geodesic distance, related divergences and optimal transport, and discuss their advantages and disadvantages. Applications of such methodology can be found in the clustering of financial assets. A tutorial, experiments and implementation for reproducible research can be found at www.datagrapple.com/Tech.


page 3

page 4


Optimal Copula Transport for Clustering Multivariate Time Series

This paper presents a new methodology for clustering multivariate time s...

Exploring and measuring non-linear correlations: Copulas, Lightspeed Transportation and Clustering

We propose a methodology to explore and measure the pairwise correlation...

Distances for Comparing Multisets and Sequences

Measuring the distance between data points is fundamental to many statis...

Toward a generic representation of random variables for machine learning

This paper presents a pre-processing and a distance which improve the pe...

Arcade Processes for Informed Martingale Interpolation and Transport

Arcade processes are a class of continuous stochastic processes that int...

Computing Kantorovich distance with a MCMC of moves

In Optimal Transport (OT) on a finite metric space, one defines a distan...

Semantic Clustering of a Sequence of Satellite Images

Satellite images constitute a highly valuable and abundant resource for ...

Please sign up or login with your details

Forgot password? Click here to reset