Early Abandoning and Pruning for Elastic Distances

02/10/2021
by   Matthieu Herrmann, et al.
0

Elastic distances are key tools for time series analysis. Straightforward implementations require O(n2)space and time complexities, preventing many applications from scaling to long series. Much work hasbeen devoted in speeding up these applications, mostly with the development of lower bounds, allowing to avoid costly distance computations when a given threshold is exceeded. This threshold also allows to early abandon the computation of the distance itself. Another approach, developed for DTW, is to prune parts of the computation. All these techniques are orthogonal to each other. In this work, we develop a new generic strategy, "EAPruned", that tightly integrates pruning with early abandoning. We apply it to DTW, CDTW, WDTW, ERP, MSM and TWE, showing substantial speedup in NN1-like scenarios. Pruning also shows substantial speedup for some distances, benefiting applications such as clustering where all pairwise distances are required and hence early abandoning is not applicable. We release our implementation as part of a new C++ library for time series classification, along with easy to usePython/Numpy bindings.

READ FULL TEXT

page 4

page 5

page 12

page 13

page 17

page 18

page 20

research
10/11/2020

Early Abandoning PrunedDTW and its application to similarity search

The Dynamic Time Warping ("DTW") distance is widely used in time series ...
research
02/07/2020

Equivalence relations and L^p distances between time series

We introduce a general framework for defining equivalence and measuring ...
research
01/05/2023

Exact and Heuristic Approaches to Speeding Up the MSM Time Series Distance Computation

The computation of the distance of two time series is time-consuming for...
research
09/23/2020

Segmented Pairwise Distance for Time Series with Large Discontinuities

Time series with large discontinuities are common in many scenarios. How...
research
08/29/2018

Elastic bands across the path: A new framework and methods to lower bound DTW

There has been renewed recent interest in developing effective lower bou...
research
01/04/2022

Elastic Product Quantization for Time Series

Analyzing numerous or long time series is difficult in practice due to t...
research
05/06/2021

Exact Acceleration of K-Means++ and K-Means

K-Means++ and its distributed variant K-Means have become de facto tools...

Please sign up or login with your details

Forgot password? Click here to reset