DeepAI AI Chat
Log In Sign Up

Early Abandoning and Pruning for Elastic Distances

by   Matthieu Herrmann, et al.

Elastic distances are key tools for time series analysis. Straightforward implementations require O(n2)space and time complexities, preventing many applications from scaling to long series. Much work hasbeen devoted in speeding up these applications, mostly with the development of lower bounds, allowing to avoid costly distance computations when a given threshold is exceeded. This threshold also allows to early abandon the computation of the distance itself. Another approach, developed for DTW, is to prune parts of the computation. All these techniques are orthogonal to each other. In this work, we develop a new generic strategy, "EAPruned", that tightly integrates pruning with early abandoning. We apply it to DTW, CDTW, WDTW, ERP, MSM and TWE, showing substantial speedup in NN1-like scenarios. Pruning also shows substantial speedup for some distances, benefiting applications such as clustering where all pairwise distances are required and hence early abandoning is not applicable. We release our implementation as part of a new C++ library for time series classification, along with easy to usePython/Numpy bindings.


page 4

page 5

page 12

page 13

page 17

page 18

page 20


Early Abandoning PrunedDTW and its application to similarity search

The Dynamic Time Warping ("DTW") distance is widely used in time series ...

Equivalence relations and L^p distances between time series

We introduce a general framework for defining equivalence and measuring ...

Exact and Heuristic Approaches to Speeding Up the MSM Time Series Distance Computation

The computation of the distance of two time series is time-consuming for...

Segmented Pairwise Distance for Time Series with Large Discontinuities

Time series with large discontinuities are common in many scenarios. How...

Elastic bands across the path: A new framework and methods to lower bound DTW

There has been renewed recent interest in developing effective lower bou...

Elastic Product Quantization for Time Series

Analyzing numerous or long time series is difficult in practice due to t...

Exact Acceleration of K-Means++ and K-Means

K-Means++ and its distributed variant K-Means have become de facto tools...