Soft and subspace robust multivariate rank tests based on entropy regularized optimal transport

03/16/2021
by   Shoaib Bin Masud, et al.
0

In this paper, we extend the recently proposed multivariate rank energy distance, based on the theory of optimal transport, for statistical testing of distributional similarity, to soft rank energy distance. Being differentiable, this in turn allows us to extend the rank energy to a subspace robust rank energy distance, dubbed Projected soft-Rank Energy distance, which can be computed via optimization over the Stiefel manifold. We show via experiments that using projected soft rank energy one can trade-off the detection power vs the false alarm via projections onto an appropriately selected low dimensional subspace. We also show the utility of the proposed tests on unsupervised change point detection in multivariate time series data. All codes are publicly available at the link provided in the experiment section.

READ FULL TEXT
research
02/15/2023

On Rank Energy Statistics via Optimal Transport: Continuity, Convergence, and Change Point Detection

This paper considers the use of recently proposed optimal transport-base...
research
10/29/2021

Robust and efficient change point detection using novel multivariate rank-energy GoF test

In this paper, we use and further develop upon a recently proposed multi...
research
10/29/2021

Learning generative models for valid knockoffs using novel multivariate-rank based statistics

We consider the problem of generating valid knockoffs for knockoff filte...
research
10/08/2021

Subspace Change-Point Detection via Low-Rank Matrix Factorisation

Multivariate time series can often have a large number of dimensions, wh...
research
08/12/2021

Change Point Analysis of Multivariate Data via Multivariate Rank-based Distribution-free Nonparametric Testing Using Measure Transportation

In this paper, I propose a general algorithm for multiple change point a...
research
10/09/2019

Spatio-Temporal Alignments: Optimal transport through space and time

Comparing data defined over space and time is notoriously hard, because ...
research
09/02/2021

dbcsp: User-friendly R package for Distance-Based Common Spacial Patterns

Common Spacial Patterns (CSP) is a widely used method to analyse electro...

Please sign up or login with your details

Forgot password? Click here to reset