Data Smashing 2.0: Sequence Likelihood (SL) Divergence For Fast Time Series Comparison

09/26/2019
by   Yi Huang, et al.
17

Recognizing subtle historical patterns is central to modeling and forecasting problems in time series analysis. Here we introduce and develop a new approach to quantify deviations in the underlying hidden generators of observed data streams, resulting in a new efficiently computable universal metric for time series. The proposed metric is in the sense that we can compare and contrast data streams regardless of where and how they are generated and without any feature engineering step. The approach proposed in this paper is conceptually distinct from our previous work on data smashing, and vastly improves discrimination performance and computing speed. The core idea here is the generalization of the notion of KL divergence often used to compare probability distributions to a notion of divergence in time series. We call this the sequence likelihood (SL) divergence, which may be used to measure deviations within a well-defined class of discrete-valued stochastic processes. We devise efficient estimators of SL divergence from finite sample paths and subsequently formulate a universal metric useful for computing distance between time series produced by hidden stochastic generators.

READ FULL TEXT

page 9

page 10

page 11

05/09/2018

Foundations of Sequence-to-Sequence Modeling for Time Series

The availability of large amounts of time series data, paired with the p...
03/30/2021

Historical Inertia: An Ignored but Powerful Baseline for Long Sequence Time-series Forecasting

Long sequence time-series forecasting (LSTF) has become increasingly pop...
05/27/2018

Measuring Congruence on High Dimensional Time Series

A time series is a sequence of data items; typical examples are videos, ...
10/16/2020

Differentiable Divergences Between Time Series

Computing the discrepancy between time series of variable sizes is notor...
11/29/2018

Recurrent Deep Divergence-based Clustering for simultaneous feature learning and clustering of variable length time series

The task of clustering unlabeled time series and sequences entails a par...
07/08/2019

Routine Modeling with Time Series Metric Learning

Traditionally, the automatic recognition of human activities is performe...
10/11/2018

Measuring Sample Path Causal Influences with Relative Entropy

We present a sample path dependent measure of causal influence between t...