Random projections for curves in high dimensions

07/15/2022
by   Ioannis Psarros, et al.
0

Modern time series analysis requires the ability to handle datasets that are inherently high-dimensional; examples include applications in climatology, where measurements from numerous sensors must be taken into account, or inventory tracking of large shops, where the dimension is defined by the number of tracked items. The standard way to mitigate computational issues arising from the high dimensionality of the data is by applying some dimension reduction technique that preserves the structural properties of the ambient space. The dissimilarity between two time series is often measured by “discrete” notions of distance, e.g. the dynamic time warping or the discrete Fréchet distance. Since all these distance functions are computed directly on the points of a time series, they are sensitive to different sampling rates or gaps. The continuous Fréchet distance offers a popular alternative which aims to alleviate this by taking into account all points on the polygonal curve obtained by linearly interpolating between any two consecutive points in a sequence. We study the ability of random projections à la Johnson and Lindenstrauss to preserve the continuous Fréchet distance of polygonal curves by effectively reducing the dimension. In particular, we show that one can reduce the dimension to O(ϵ^-2log N), where N is the total number of input points while preserving the continuous Fréchet distance between any two determined polygonal curves within a factor of 1±ϵ. We conclude with applications on clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/09/2022

Computing Continuous Dynamic Time Warping of Time Series in Polynomial Time

Dynamic Time Warping is arguably the most popular similarity measure for...
research
07/23/2018

SETH Says: Weak Fréchet Distance is Faster, but only if it is Continuous and in One Dimension

We show by reduction from the Orthogonal Vectors problem that algorithms...
research
05/27/2018

Measuring Congruence on High Dimensional Time Series

A time series is a sequence of data items; typical examples are videos, ...
research
05/19/2020

A reduction of the dynamic time warping distance to the longest increasing subsequence length

The similarity between a pair of time series, i.e., sequences of indexed...
research
07/16/2019

Random projections and sampling algorithms for clustering of high-dimensional polygonal curves

We study the center and median clustering problems for high-dimensional ...
research
06/05/2021

Discrete Frechet distance for closed curves

The paper presents a discrete variation of the Frechet distance between ...
research
12/14/2020

Clustering high dimensional meteorological scenarios: results and performance index

The Reseau de Transport d'Electricité (RTE) is the French main electrici...

Please sign up or login with your details

Forgot password? Click here to reset